When AI Discovers the Next Transformer — Robert Lange

Interviews2 months ago

When AI Discovers the Next Transformer — Robert Lange

Descriptions:

Robert Lange, a founding researcher at Sakana AI, joins Machine Learning Street Talk host Tim to discuss Shinka Evolve — a new paper that extends the evolutionary LLM program synthesis approach pioneered by AlphaEvolve. The conversation covers how language models can iteratively generate, mutate, and evaluate programs using Upper Confidence Bound (UCB) selection, mutable code markers to protect critical imports and evaluation scaffolding from unintended edits, and rejection sampling with reflection to enforce structural constraints.

A central theme is sample efficiency: while similar systems may sample thousands of programs per task, Shinka Evolve targets comparable performance with far fewer evaluations. Lange explains the information representation challenge — compressing evaluation histories to fit within context windows — and discusses future directions such as multi-file codebase mutation, repository maps inspired by the Aider coding tool, and the potential for active fine-tuning during evolutionary runs.

The broader conversation touches on Sakana AI’s research philosophy, shaped by CEO David Ha and inspired by Ken Stanley’s open-endedness framework. Lange reflects on what it would mean for AI systems to autonomously discover architectural innovations as significant as the Transformer itself — framing current LLM-driven search as a step toward that Rubicon. The episode also briefly covers NVIDIA GTC 2026, the leaked Nemo Claw open-source agent platform, and the general challenge of verifying program correctness versus generating candidate solutions.

📺 Source: Machine Learning Street Talk · Published March 13, 2026
🏷️ Format: Interview

1 Item

Channels

No Image Available

Machine Learning Street Talk

Tags

ARC AGI NemoClaw Nvidia SWE-bench

Prev

Stripe’s Coding Agents Ship 1,300 PRs EVERY Week – Here’s How They Do It

Stripe’s Coding Agents Ship 1,300 PRs EVERY Week – Here’s How They Do It

Next

The Social Network for Agents Just Got Acquired

The Social Network for Agents Just Got Acquired

18 Related Posts

Related Posts

08:44

Interviews

AI Chipmaker Cerebras Raises $5.55 Billion in Year’s Biggest IPO

23 hours ago

01:06:38

Interviews

Inside Abridge: The AI Listening to 100 Million Doctor Visits — Abridge’s Janie Lee & Chai Asawa

23 hours ago

16:39

Interviews

How Emergent is making app building more accessible with Claude

2 days ago

01:16:02

Interviews

TypeScript, C# and Turbo Pascal with Anders Hejlsberg

2 days ago

23:34

Interviews

The Founders Who Left Tesla to Rebuild America | a16z

2 days ago

46:56

Interviews

“There Is No Task Agents Cannot Do” – Magnus Müller

2 days ago