Captaining IMO Gold, Deep Think, On-Policy RL, Feeling the AGI in Singapore — Yi Tay

Interviews5 months ago

Captaining IMO Gold, Deep Think, On-Policy RL, Feeling the AGI in Singapore — Yi Tay

Descriptions:

Yi Tay, a leading researcher at Google DeepMind who heads the Reasoning & AGI team in Singapore, joins the Latent Space podcast for a wide-ranging conversation on frontier AI research. Having returned to GDM after 18 months at Reka, Tay describes how he transitioned from years of architecture and pretraining work into reinforcement learning — now the dominant modeling paradigm for capable reasoning systems like Deep Think, GDM’s model that achieved gold at the International Mathematical Olympiad.

The discussion covers why on-policy RL has become the central lever for improving reasoning, and Tay’s observation that fundamental research skills transfer surprisingly well across paradigms. He also shares how his own use of AI coding tools has evolved dramatically — from occasionally generating matplotlib plots, to now delegating debugging entirely without even reading error logs, a mode he describes as ‘vibe ML’ or ‘vibe training.’

The episode covers the naming of GDM’s Singapore team, the cultural moment of Amazon, Google, and Meta all signaling AGI intent through team branding, and what it felt like to return to Google infrastructure after an extended absence. Tay offers candid insider perspective on how a top AI lab approaches the reasoning frontier and why RL has become the lingua franca of model development.

📺 Source: Latent Space · Published January 23, 2026
🏷️ Format: Interview

1 Item

Companies

No Image Available

DeepMind

Tags

DeepMind Gemini YouTube

Prev

Build AI SaaS Apps for Free Using n8n and Google Firebase studio(Step-by-Step Tutorial)

Build AI SaaS Apps for Free Using n8n and Google Firebase studio(Step-by-Step Tutorial)

Next

7 NEW NotebookLM Use Cases for February 2026

7 NEW NotebookLM Use Cases for February 2026

18 Related Posts

Related Posts

07:36

Interviews

Microsoft Shifts Strategy on Enterprise AI

2 days ago

02:00:20

Interviews

Claude Fable 5 Is BACK (And It’s Different)

2 days ago

01:18:07

Interviews

Coinbase Cuts AI Spend by 50% | Kalshi’s $40B Valuation & Impending IPO | The Year for SaaS Roll-Ups

2 days ago

44:07

Interviews

Tesla Deliveries Jump 25% | Bloomberg Tech 7/02/2026

2 days ago

05:14

Interviews

Nuclear Reactor Powers Nvidia AI Chip in US First

2 days ago

01:24:35

Interviews

ARC-AGI-3 Explained by the Team That’s Winning It

3 days ago