Member-only story
Your Daily AI Research tl;dr — 2022–08–15 🧠
Useful guidance in using DALL-E 2, Deepmind’s Transframer and synthesis of musical instruments sounds.
Welcome to your official daily AI research tl;dr (often with code and news) for AI professionals where I share the most exciting papers I find daily, along with a one-liner summary to help you quickly determine if the article (and code) is worth investigating.
1️⃣ Transframer: Arbitrary Frame Prediction with Generative Models [deepmind paper]
Transframer: A general-purpose framework for image modeling and vision tasks based on probabilistic frame prediction, combining U-Net and Transformer components.
Link to the paper: https://arxiv.org/abs/2203.09494
2️⃣ DDX7: DIFFERENTIABLE FM SYNTHESIS OF MUSICAL INSTRUMENT SOUNDS
“We present Differentiable DX7 (DDX7), a lightweight architecture for neural FM resynthesis of musical instrument sounds in terms of a compact set of parameters.”
What does this mean?
“We train the model on instrument samples extracted from the URMP dataset, and quantitatively demonstrate its comparable audio…