Member-only story

Your Daily AI Research tl;dr — 2022–08–15 🧠

Useful guidance in using DALL-E 2, Deepmind’s Transframer and synthesis of musical instruments sounds.

Louis-François Bouchard
2 min readAug 15, 2022
Llama playing basketball, generated using DALL·E 2 by Joy Zhang.

Welcome to your official daily AI research tl;dr (often with code and news) for AI professionals where I share the most exciting papers I find daily, along with a one-liner summary to help you quickly determine if the article (and code) is worth investigating.

1️⃣ Transframer: Arbitrary Frame Prediction with Generative Models [deepmind paper]

Transframer: A general-purpose framework for image modeling and vision tasks based on probabilistic frame prediction, combining U-Net and Transformer components.

Link to the paper: https://arxiv.org/abs/2203.09494

2️⃣ DDX7: DIFFERENTIABLE FM SYNTHESIS OF MUSICAL INSTRUMENT SOUNDS

“We present Differentiable DX7 (DDX7), a lightweight architecture for neural FM resynthesis of musical instrument sounds in terms of a compact set of parameters.”

What does this mean?

“We train the model on instrument samples extracted from the URMP dataset, and quantitatively demonstrate its comparable audio…

--

--

Louis-François Bouchard
Louis-François Bouchard

Written by Louis-François Bouchard

I try to make Artificial Intelligence accessible to everyone. Ex-PhD student, AI Research Scientist, and YouTube (What’s AI). https://www.louisbouchard.ai/

No responses yet