Gemini Exponential, Demis Hassabis’ ‘Proto-AGI’ coming, but …

Business & Strategy7 months ago

Gemini Exponential, Demis Hassabis’ ‘Proto-AGI’ coming, but …

Descriptions:

AI Explained condenses roughly 10 hours of Google DeepMind interviews and model release coverage into a focused analysis of Gemini 3 Flash, released in December 2025. The video leads with benchmark data showing Gemini 3 Flash outperforming the prior state-of-the-art Gemini 2.5 Pro across mathematics, coding, and visual reasoning — including a jump from 88% to 95.2% on the AIM mathematics benchmark — despite being the faster, lower-latency variant of the model family.

Beyond the headline numbers, the video raises a critical and often-overlooked issue: Gemini 3 Flash rarely admits uncertainty. Of questions it answers incorrectly, 91% result in hallucinated responses rather than “I don’t know” — compared to a roughly 50/50 split for GPT models. This connects to a broader point about how AI labs currently incentivize models to always attempt an answer, which OpenAI called an “epidemic” in a September 2025 paper advocating for rewarding calibrated uncertainty instead.

The video closes with Demis Hassabis outlining his vision for a “proto-AGI” that would unify Gemini 3, the Genie 3 world-simulation model, the Simmer 2 gaming agent, and Nano Banana Pro image generation into a single integrated system. The host tempers that optimism by pointing to current limitations in physical-world understanding within these models, noting that even basic Newtonian mechanics remain approximations rather than reliable simulations.

📺 Source: AI Explained · Published December 19, 2025
🏷️ Format: News Analysis

1 Item

Channels

No Image Available

AI Explained

2 Items

Companies

No Image Available

DeepMind

No Image Available

OpenAI

1 Item

People

No Image Available

Demis Hassabis

Tags

ChatGPT Claude DeepMind Demis Hassabis Gemini 3 Flash Genie 3 GPT-5.1 GPT-5.2 Greg Brockman OpenAI Sam Altman

Prev

Shipmas Day 14: Can AI Agents "Dream" In a Simulation?

Shipmas Day 14: Can AI Agents "Dream" In a Simulation?

Next

Ethernet is DEAD?? Mac Studio is 100x FASTER!!

Ethernet is DEAD?? Mac Studio is 100x FASTER!!

18 Related Posts

Related Posts

29:21

Business & Strategy

AI News: Fable’s Back But This New Model is Better?

23 hours ago

42:25

Business & Strategy

a16z Goes Global: Why American Tech Must Lead the World

23 hours ago

21:14

Business & Strategy

The Best AI Coding Setup Isn’t the Most Autonomous One (Here’s Why)

23 hours ago

09:36

Business & Strategy

How Claude is Creating a New Generation of Millionaires

23 hours ago

11:26

Business & Strategy

The future of work with @Claude

2 days ago

20:13

Business & Strategy

The Prompt Is Still a Punch Card – Ted Johnson, JoinIn AI

2 days ago