How Google DeepMind Runs Agents at Scale — KP Sawhney & Ian Ballantyne, Google DeepMind

Foundation Models2 months ago

How Google DeepMind Runs Agents at Scale — KP Sawhney & Ian Ballantyne, Google DeepMind

Descriptions:

Google DeepMind software engineer KP Sawhney and developer relations engineer Ian Ballantyne take the stage at the AI Engineer conference to walk through how DeepMind designs and scales agentic systems in production. The talk centers on Antigravity, DeepMind’s internal Visual Studio-style IDE that bundles a full agent manager framework, allowing developers to spawn and coordinate multiple agents across projects with built-in planning, browser control, DOM inspection, screenshot capture, and human-in-the-loop feedback at each step.

Sawhney, who previously built DeepMind’s Deep Research agent (now available via the Interactions API), details the platform team’s current engineering focus: scaling agentic workflows across DeepMind’s large monorepo and generalizing the Antigravity harness to broader use cases. He covers multi-model routing strategies — using lightweight, quota-free models like Gemma 4 for cost-sensitive subtasks while reserving more capable models for critical reasoning steps — as well as evaluation design for complex agentic pipelines, including mock-TPU environments that let teams test harness logic without burning real compute hours.

The conversation rounds out with the hard operational problem of resource fairness: how to prevent power users from starving shared infrastructure by spinning up large fleets of parallel agents. Sawhney acknowledges the current approach is essentially brute-force quota enforcement, and frames this as a bellwether for broader open questions about how token-hungry agentic systems will ultimately be priced — pointing to Anthropic’s recent moves around subscription limits as an early indicator of where the industry is heading.

📺 Source: AI Engineer · Published May 24, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

AI Engineer

1 Item

Companies

No Image Available

DeepMind

Tags

Anthropic Antigravity Deep Research DeepMind Gemini Gemini Flash Gemma 4 GitHub Google MCP TPU

Prev

SpaceX’ $75B+ Historic IPO, GPT5.5 Outperforms Polymarket, AI Solves 80yr old math problem | EP #257

Next

Master LTX Director: The Ultimate Timeline Control for AI Video| Multi-Frame Reference

18 Related Posts

Related Posts

21:09

Foundation Models

Persona Engineering: A Field Guide to AI Synthetic Personas — Ishan Anand, InsightSciences.ai

1 day ago

21:39

Foundation Models

Serving 2 Million Models Without Melting: Scaling the Hugging Face Hub — Arek Borucki, Hugging Face

2 days ago

06:40

Foundation Models

AMD Releases First Ever AI model: Instella-MoE-16B-A3B-Think

2 days ago

24:01

Foundation Models

US AI Dominance Is Over: Here’s Why

3 days ago

17:31

Foundation Models

The Messy Reality of Scale: Synthetic Data and Pre-Training — Marah Abdin & Robert McHardy, poolside

4 days ago

20:24

Foundation Models

From Agent Traces to Agent Simulations — Rustem Feyzkhanov, Snorkel AI

5 days ago