Google’s New AI Is Smarter Than Everyone’s But It Costs HALF as Much

Business & Strategy3 months ago

Google’s New AI Is Smarter Than Everyone’s But It Costs HALF as Much

Descriptions:

Nate B Jones examines the release of Google’s Gemini 3.1 Pro — a model that leads on 13 of 16 major benchmarks, scores 77.1% on the ARC-AGI2 novel reasoning test (more than doubling Gemini 3 Pro’s score of 31.1% from just 90 days earlier), and is priced at roughly one-seventh the cost of Anthropic’s Claude Opus 4.6. Jones argues that the benchmark numbers, while real, are not the actual story — and that the pricing strategy reveals something more strategically significant about where Google is headed.

The video draws a sharp contrast between how the three leading AI labs have chosen to optimize their frontier models. Anthropic built Opus 4.6 for sustained agentic coding — multi-agent coordination across codebases over days or weeks. OpenAI built Codex 5.3 for specialized coding pipelines with self-bootstrapping sandboxes and high-throughput inference. Google built Gemini 3.1 Pro specifically for deep reasoning on genuinely novel problems — the ARC-AGI2 benchmark explicitly tests logic problems the model has never seen, making retrieval from training data useless. Jones connects this design choice directly to DeepMind CEO Demis Hassabis’s 15-year mission statement: “solve intelligence first, then use it to solve everything else.”

The analysis then becomes a practical framework for how individuals and organizations should actually choose between frontier models. Jones argues that “hard” is not a single thing — problems can be hard because they require deep reasoning (Gemini 3.1 Pro’s strength), because they are large in scale but cognitively simple (effort-bottlenecked), or because they involve ambiguity and judgment where human context is irreplaceable. Matching model capability to problem type, rather than chasing the highest overall benchmark rank, is the more useful decision framework for anyone deploying AI in real workflows.

📺 Source: Nate B Jones · Published February 23, 2026
🏷️ Format: News Analysis

1 Item

Companies

No Image Available

Google

Tags

Prev

Anthropic Tested 16 Models. Instructions Didn’t Stop Them

Anthropic Tested 16 Models. Instructions Didn’t Stop Them

Next

the SCARIEST chart in AI

the SCARIEST chart in AI

18 Related Posts

Related Posts

24:56

Business & Strategy

everyone JUST got HACKED…

1 hour ago

33:09

Business & Strategy

AI News: Impressive New Model From Unexpected Company

1 hour ago

18:27

Business & Strategy

Combine Skills and MCP to Close the Context Gap — Pedro Rodrigues, Supabase

1 hour ago

06:46

Business & Strategy

The trial of the century is even dumber than expected…

1 hour ago

41:05

Business & Strategy

Anthropic on USA vs China

1 hour ago

44:03

Business & Strategy

Cerebras Goes Public in Year’s Biggest IPO | Bloomberg Tech 5/14/2026

1 day ago