⚡️ Google’s Open AI Strategy — Omar Sanseviero, Google DeepMind

Interviews2 months ago

⚡️ Google’s Open AI Strategy — Omar Sanseviero, Google DeepMind

Descriptions:

In this Latent Space podcast interview, Omar Sanseviero from Google DeepMind walks through the technical decisions and strategic thinking behind Gemma 4 and Google’s broader open model program, covering architecture, deployment partnerships, and evolving fine-tuning trends.

Sanseviero explains the ‘effective parameters’ approach used in Gemma 4’s smaller variants: a per-layer embedding table that allows a model with nearly 5 billion total parameters to load only 2 billion into GPU memory, with the remainder offloadable to CPU or disk — designed specifically for on-device inference on Android phones and Raspberry Pi. The 29B and 31B Gemma 4 models use a different architecture for larger deployments, and Sanseviero notes experiments with scaling the per-layer embedding approach are ongoing. He also describes coordinating the Gemma 4 launch with approximately 50 external partners including llama.cpp, Ollama, vLLM, Hugging Face, Nvidia, AMD, and an Android Studio integration enabling offline coding assistance with Gemma 4 locally.

The conversation covers shifting fine-tuning trends — many partners who planned to fine-tune Gemma 4 found the base model performed well enough out of the box — as well as early research into diffusion-based text generation models, how Gemma Nano powers on-device Gemini features in Pixel and high-end Samsung devices, and why Google sees open models as central to its long-term AI platform strategy.

📺 Source: Latent Space · Published May 24, 2026
🏷️ Format: Interview

1 Item

Companies

No Image Available

DeepMind

Tags

AMD Android DeepMind Gemini Gemma 4 Hugging Face Llama CPP Nvidia Ollama Samsung VLLM

Prev

SpaceX’ $75B+ Historic IPO, GPT5.5 Outperforms Polymarket, AI Solves 80yr old math problem | EP #257

Next

Master LTX Director: The Ultimate Timeline Control for AI Video| Multi-Frame Reference

18 Related Posts

Related Posts

01:30:17

Interviews

Ray Dalio: I Predicted The 2008 CRASH, I Know What Comes Next

1 hour ago

01:20:22

Interviews

Travis Kalanick Raises $1.7B for Atoms | Google Cloud Grows 82% But The Market Tanks

1 hour ago

58:40

Interviews

How Lassie Is Automating Healthcare Administration

1 hour ago

01:24:53

Interviews

Formal methods with Hillel Wayne

1 day ago

01:08:35

Interviews

The $1/Hour Robot Is Coming: Four Industry Leaders Explain What’s Next

1 day ago

01:39:19

Interviews

Everyone is saying SOFTWARE IS DEAD (LIVE Q&A)

1 day ago