State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents

Foundation Models2 months ago

State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents

Descriptions:

Lex Fridman convenes Sebastian Raschka — author of “Build a Large Language Model from Scratch” and “Build a Reasoning Model from Scratch” — and Nathan Lambert, post-training lead at the Allen Institute for AI and author of the definitive text on Reinforcement Learning from Human Feedback, for a comprehensive review of where artificial intelligence stands in early 2026.

The conversation uses January 2025’s DeepSeek R1 release as its dividing line: the moment an open-weight Chinese model achieved near-frontier reasoning performance at dramatically lower cost, reshaping competitive assumptions industry-wide. From there, Raschka and Lambert examine who is winning the global AI race — contrasting Anthropic’s Claude Opus 4.5 (which has driven extraordinary organic community excitement), Google’s Gemini 3, and OpenAI with the accelerating output from Chinese developers. They dig into the technical substance driving these results: how Reinforcement Learning from Verifiable Rewards (RLVR) works mechanistically, what it actually unlocks versus what is already baked into pretraining weights, illustrated with a striking concrete example of Qwen 3 base jumping from 15% to 50% accuracy on the MATH-500 benchmark in just 50 training steps. This leads to a sharp debate about data contamination in Qwen evaluations and what such rapid gains actually prove.

Scaling laws, the economics of reasoning models with long output contexts, and emerging agent capabilities round out an episode that stands as one of the most technically rigorous and accessible State-of-AI reviews available for 2026.

📺 Source: Lex Fridman
🏷️ Format: Deep Dive

4 Items

Companies

No Image Available

Anthropic

No Image Available

DeepSeek

No Image Available

Google

No Image Available

OpenAI

Tags

Anthropic ChatGPT China Claude Code Claude Opus 4.5 Cursor DeepSeek DeepSeek R1 Gemini 3 Google GPT-5 Llama Meta Nvidia OpenAI Transformers United States xAI

Prev

OpenClaw: The Viral AI Agent — Peter Steinberger

OpenClaw: The Viral AI Agent — Peter Steinberger

Next

Dario Amodei: Anthropic CEO on Claude, AGI & Future of AI

Dario Amodei: Anthropic CEO on Claude, AGI & Future of AI

18 Related Posts

Related Posts

31:55

Foundation Models

The biggest AI breakthrough in medicine & drug discovery

1 day ago

01:20:07

Foundation Models

Mind the Gap (In your Agent Observability) — Amy Boyd & Nitya Narasimhan, Microsoft

1 day ago

25:53

Foundation Models

The Trillion Dollar Agentic Workflow Opportunity Is Here

1 day ago

20:09

Foundation Models

Pinecone Just Demoted Vector Search. Here’s the Knowledge Layer.

2 days ago

14:27

Foundation Models

Claude Makes Dashboards Too Easy. That’s the Problem.

2 days ago

18:37

Foundation Models

CI/CD Is Dead, Agents Need Continuous Compute and Computers — Hugo Santos and Madison Faulkner

2 days ago