Gemini Exponential, Demis Hassabis’ ‘Proto-AGI’ coming, but …

Gemini Exponential, Demis Hassabis’ ‘Proto-AGI’ coming, but …

More

Descriptions:

AI Explained condenses roughly 10 hours of Google DeepMind interviews and model release coverage into a focused analysis of Gemini 3 Flash, released in December 2025. The video leads with benchmark data showing Gemini 3 Flash outperforming the prior state-of-the-art Gemini 2.5 Pro across mathematics, coding, and visual reasoning — including a jump from 88% to 95.2% on the AIM mathematics benchmark — despite being the faster, lower-latency variant of the model family.

Beyond the headline numbers, the video raises a critical and often-overlooked issue: Gemini 3 Flash rarely admits uncertainty. Of questions it answers incorrectly, 91% result in hallucinated responses rather than “I don’t know” — compared to a roughly 50/50 split for GPT models. This connects to a broader point about how AI labs currently incentivize models to always attempt an answer, which OpenAI called an “epidemic” in a September 2025 paper advocating for rewarding calibrated uncertainty instead.

The video closes with Demis Hassabis outlining his vision for a “proto-AGI” that would unify Gemini 3, the Genie 3 world-simulation model, the Simmer 2 gaming agent, and Nano Banana Pro image generation into a single integrated system. The host tempers that optimism by pointing to current limitations in physical-world understanding within these models, noting that even basic Newtonian mechanics remain approximations rather than reliable simulations.


📺 Source: AI Explained · Published December 19, 2025
🏷️ Format: News Analysis

1 Item

Channels

2 Items

Companies

1 Item

People