Member-only story
Your Daily AI Research tl;dr — 2022–09–14 🧠
a Stable Diffusion GUI App for M1 Macs, Controllable 3D-Aware Portrait Generation, and Multi-Lingual Visual Question Answering…
Welcome to your official daily AI research tl;dr (often with code and news) for AI professionals where I share the most exciting papers I find daily, along with a one-liner summary to help you quickly determine if the article (and code) is worth investigating.
1️⃣ Explicitly Controllable 3D-Aware Portrait Generation
“We propose a 3D portrait generation network that produces 3D consistent portraits while being controllable according to semantic parameters regarding pose, identity, expression and lighting.”
Link to the paper: https://arxiv.org/pdf/2209.05434.pdf
2️⃣ Towards Multi-Lingual Visual Question Answering
“We propose scalable solutions to multilingual visual question answering (mVQA), on both data and modeling fronts” mainly by proposing “a translation-based framework to mVQA data generation that requires much less human annotation efforts than the conventional approach of directly collection questions and answers.”
Link to the paper: https://arxiv.org/pdf/2209.05401.pdf