Member-only story
Your Daily AI Research tl;dr — 2022–08–26 🧠
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation, A free AI writing assistant built on top of OpenAI’s GPT-3 and Video Mobile-Former…
Welcome to your official daily AI research tl;dr (often with code and news) for AI professionals where I share the most exciting papers I find daily, along with a one-liner summary to help you quickly determine if the article (and code) is worth investigating.
1️⃣ DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
“With just a few images (typically 3–5) of a subject (left), DreamBooth — our AI-powered photo booth — can generate a myriad of images of the subject in different contexts (right), using the guidance of a text prompt.”
Link to the paper: https://arxiv.org/pdf/2208.12242.pdf
2️⃣ Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
“We extend Mobile-Former to Video Mobile-Former, which decouples the video architecture into a lightweight 3D-CNNs for local context modeling…