Member-only story
Your Daily AI Research tl;dr — 2022–10–18 🧠
Google’s new AI can hear a snippet of song — and then keep on playing, a framework enabling research on hour-long videos and Transformer for 3D Object Detection!
Welcome to your official daily AI research tl;dr (often with code and news) for AI professionals where I share the most exciting papers I find daily, along with a one-liner summary to help you quickly determine if the article (and code) is worth investigating.
1️⃣ Compressed Vision for Efficient Video Understanding
“We propose a framework enabling research on hour-long videos with the same hardware that can now process second-long videos.”
Link to the paper: https://arxiv.org/pdf/2210.02995.pdf
2️⃣ SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds
“We propose Sparse Window Transformer (SWFormer), a scalable and accurate model for 3D object detection, which can take full advantage of the sparsity of point clouds.”
Link to the paper: https://arxiv.org/pdf/2210.07372.pdf