Google’s New AI Is Smarter Than Everyone’s But It Costs HALF as Much

Google’s New AI Is Smarter Than Everyone’s But It Costs HALF as Much

More

Descriptions:

Nate B Jones examines the release of Google’s Gemini 3.1 Pro — a model that leads on 13 of 16 major benchmarks, scores 77.1% on the ARC-AGI2 novel reasoning test (more than doubling Gemini 3 Pro’s score of 31.1% from just 90 days earlier), and is priced at roughly one-seventh the cost of Anthropic’s Claude Opus 4.6. Jones argues that the benchmark numbers, while real, are not the actual story — and that the pricing strategy reveals something more strategically significant about where Google is headed.

The video draws a sharp contrast between how the three leading AI labs have chosen to optimize their frontier models. Anthropic built Opus 4.6 for sustained agentic coding — multi-agent coordination across codebases over days or weeks. OpenAI built Codex 5.3 for specialized coding pipelines with self-bootstrapping sandboxes and high-throughput inference. Google built Gemini 3.1 Pro specifically for deep reasoning on genuinely novel problems — the ARC-AGI2 benchmark explicitly tests logic problems the model has never seen, making retrieval from training data useless. Jones connects this design choice directly to DeepMind CEO Demis Hassabis’s 15-year mission statement: “solve intelligence first, then use it to solve everything else.”

The analysis then becomes a practical framework for how individuals and organizations should actually choose between frontier models. Jones argues that “hard” is not a single thing — problems can be hard because they require deep reasoning (Gemini 3.1 Pro’s strength), because they are large in scale but cognitively simple (effort-bottlenecked), or because they involve ambiguity and judgment where human context is irreplaceable. Matching model capability to problem type, rather than chasing the highest overall benchmark rank, is the more useful decision framework for anyone deploying AI in real workflows.


📺 Source: Nate B Jones · Published February 23, 2026
🏷️ Format: News Analysis

1 Item

Companies