Artificial Intelligence, Research

A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code

Image for post
Image for post
Photo by Kelly Sikkema on Unsplash

Even with everything that happened in the world this year, we still had the chance to see a lot of amazing research come out. Especially in the field of artificial intelligence. More, many important aspects were highlighted this year, like the ethical aspects, important biases, and much more. Artificial intelligence and our understanding of the human brain and its link to AI is constantly evolving, showing promising applications in the soon future.

Here are the most interesting research papers of the year, in case you missed any of them. In short, it is basically a curated list of the latest…

Computer Vision

Just imagine how cool it would be to just take a picture of an object and have it in 3D to insert in the movie or video game you are creating or in a 3D scene for an illustration.

Image for post
Image for post

Neural scene representation from a single image is a really complex problem. The “end goal” is to be able to take a picture from a real-life object, and translate this picture into a 3D scene. It implies that the model understands a whole 3-dimensional scene, or real-life scene, using information from a single picture. This can sometimes be hard even for humans where the colors, or shadows in the image trick our eyes.

Artificial Intelligence, Innovation

It is both very clever and simple and you could use this same model for many image classification applications.

Image for post
Image for post
FMMLs [1]

FMMLs

Odei Garcia-Garin et al. from the University of Barcelona have developed a deep learning-based algorithm able to detect and quantify floating garbage from aerial images. They also made a web-oriented application allowing users to identify these garbages, called floating marine macro-litter, or FMML, within images of the sea surface. Floating marine macro-litter is any persistent, manufactured, or processed solid material lost or abandoned in a marine compartment. As you most certainly know, these plastic wastes are dangerous for fish, turtles, and marine mammals as they can either ingest them or get entangled and hurt.

How to get rid of FMMLs?

Artificial Intelligence, Research

The 3 most interesting AI papers this month with video demos, short articles, code, and paper reference.

Image for post
Image for post

Here are the 3 most interesting research papers of the month, in case you missed any of them. It is a curated list of the latest breakthroughs in AI and Data Science by release date with a clear video explanation, link to a more in-depth article, and code (if applicable). Enjoy the read, and let me know if I missed any important papers in the comments, or by contacting me directly on LinkedIn!

Follow me on Medium to see this AI top 3 every month!

Paper #1:

DALL·E: Generate Images from Text Captions! Inspired by GPT-3 and Image-GPT from OpenAI [1]

OpenAI successfully trained a network able to generate images from text captions. …

Computer Vision

Tl;DR: They combined the efficiency of GANs and convolutional approaches with the expressivity of transformers to produce a powerful and time-efficient method for semantically-guided high-quality image synthesis.

If the title and subtitle sound like another language to you, this article was made for you!

Image for post
Image for post

Image-GPT

You’ve probably heard of iGPT, or Image-GPT recently published by OpenAI that I covered on my channel. It is the state-of-the-art generative transformer model. OpenAI used the transformer architecture on a pixel-representation of images to perform image synthesis. In short, they use transformers with half the pixels of an image as inputs to generate the other half of the image. As you can see here, it is extremely powerful.

Computer Vision, Deep Learning

Google used a modified StyleGAN2 architecture to create an online fitting room where you can automatically try-on any pants or shirts you want using only an image of yourself.

Image for post
Image for post

VOGUE: Try-On by StyleGAN Interpolation Optimization [1]

A team of researchers from Google, MIT, and the University of Washington recently published a paper called “VOGUE: Try-On by StyleGAN Interpolation Optimization”. They use a GAN architecture to create an online fitting room, where you can automatically try-on any pants or shirts you want using only an image of yourself. Also called garment transfer, the goal is to take the clothes from a person in a picture and transfer it onto someone else while conserving the correct body shape, hair, and skin color. …

Artificial Intelligence, Neuroscience

Drawing inspiration from Human Capabilities Towards a more general and trustworthy AI & 10 Questions for the AI Research Community.

Image for post
Image for post

Table of contents

Towards more general and trustworthy AI

I will let Francesca Rossi introduce this article with her great remark made at the AI Debate 2 organized by Montreal AI:

These are the reasons why Francesca Rossi and her team at IBM published this paper proposing a research direction to advance AI. Drawing inspiration from cognitive theories of human decision making. …

Computer Vision, Natural Language Processing

OpenAI successfully trained a network able to generate images from text captions. It is very similar to GPT-3 and Image GPT and produces amazing results.

DALL-E is a new neural network developed by OpenAI based on GPT-3.
In fact, it’s a smaller version of GPT-3 using 12-billion parameters instead of 175 billion. But it has been specifically trained to generate images from text descriptions, using a dataset of text-image pairs instead of a very broad dataset like GPT-3. It can create images from text captions using natural language, just like GPT-3 creates websites and stories.

Image for post
Image for post
Image via https://openai.com/blog/dall-e/

It’s a continuation of Image GPT and GPT-3 that I both covered in previous videos if you haven’t watched them yet.

DALL-E is very similar to GPT-3…

Computer Vision, Research

The top 10 computer vision papers in 2020 with video demos, articles, code, and paper reference.

Image for post
Image for post

Even with everything that happened in the world this year, we still had the chance to see a lot of amazing research come out. Especially in the field of artificial intelligence and more precisely computer vision. More, many important aspects were highlighted this year, like the ethical aspects, important biases, and much more. Artificial intelligence and our understanding of the human brain and its link to AI is constantly evolving, showing promising applications in the soon future, which I will definitely cover.

Here are my top 10 of the most interesting research papers of the year in computer vision, in…

This new method is able to generate a complete 3-dimensional scene and has the ability to decide the lighting of the scene. All this with very limited computation costs and amazing results compared to previous approaches.

Image for post
Image for post
Image via: P. P. Srinivasan et al., “Nerv: Neural reflectance and visibility fields for relighting and view synthesis”

NeRV, or Neural Reflectance and Visibility Fields for Relighting and View Synthesis, is a method that produces a 3D representation of a scene and can generate arbitrary lighting conditions. It only needs a set of images of the scene as inputs to generate novel viewpoints of the scene under any chosen lighting conditions!

Louis (What’s AI) Bouchard

I explain Artificial Intelligence terms and news to non-experts. Master student, AI Research Scientist, and YouTube speaker. https://www.youtube.com/c/WhatsAI

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store