Member-only story

Your Daily AI Research tl;dr — 2022–10–28 🧠

I was featured in the “Who’s Who in artificial intelligence” report! (& very cool large language models papers)

2 min readOct 28, 2022

Welcome to your official daily AI research tl;dr (often with code and news) for AI professionals where I share the most exciting papers I find daily, along with a one-liner summary to help you quickly determine if the article (and code) is worth investigating.

Receive your daily update right in your inbox ⬇️

1️⃣ Large language models are not zero-shot communicators

Humans intuitively understand the response “I wore gloves” to the question “Did you leave fingerprints?” as meaning “No”. They design a simple task and evaluate widely used state-of-the-art models to figure out if large language models can understand implicatures (the type of inference mentioned above).

Link to the paper: https://arxiv.org/abs/2210.14986

2️⃣ What Language Model to Train if You Have One Million GPU Hours?

“We perform an ablation study at the billion-parameter scale comparing different modeling practices and their impact on zero-shot generalization.”

Your Daily AI Research tl;dr — 2022–10–28 🧠

I was featured in the “Who’s Who in artificial intelligence” report! (& very cool large language models papers)

1️⃣ Large language models are not zero-shot communicators

2️⃣ What Language Model to Train if You Have One Million GPU Hours?

Written by Louis-François Bouchard

No responses yet