New DeepSeek Research – The Future Is Here!

Foundation Models3 months ago

New DeepSeek Research – The Future Is Here!

Descriptions:

Two Minute Papers host Dr. Károly Zsolnai-Fehér argues that DeepSeek’s expanded 80-page technical report may represent the first complete, openly reproducible recipe for building ChatGPT-level AI — a direct contrast to OpenAI’s GPT-4 paper, which explicitly omitted architecture, hardware, compute, and training details. The video walks through five key technical findings from the DeepSeek R1 research that Zsolnai-Fehér considers genuine breakthroughs.

The most significant contributions include GRPO (Group Relative Policy Optimization), which replaces the expensive PPO teacher-student training setup by having the model generate 16 candidate answers and ranking them against each other — dramatically reducing training cost. The video also covers DeepSeek R1’s discovery of chain-of-thought reasoning through pure reinforcement learning: starting with zero human examples, the model climbed from roughly 15% to nearly 80% accuracy on competition mathematics by itself, spontaneously developing behaviors like pausing to reconsider answers. A fourth finding examines why a small number of initial examples (a “flashlight”) accelerates learning in natural language tasks — more than tripling AlpacaEval performance — while adding little to abstract math. The fifth and most impactful contribution is distillation: DeepSeek used R1 to generate 800,000 reasoning examples, allowing much smaller models to inherit its capabilities.

The video positions this release as a landmark for open-source AI development and scientific reproducibility.

📺 Source: Two Minute Papers · Published February 04, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

Two Minute Papers

2 Items

Companies

No Image Available

DeepSeek

No Image Available

OpenAI

Tags

ChatGPT DeepSeek DeepSeek R1 GPT-4 GPT-4o OpenAI

Prev

How I Make Cinematic AI Videos Using Higgsfield Cinema Studio

How I Make Cinematic AI Videos Using Higgsfield Cinema Studio

Next

Claude Opus 4.6 Is Here: Everything You Need to Know

Claude Opus 4.6 Is Here: Everything You Need to Know

18 Related Posts

Related Posts

31:55

Foundation Models

The biggest AI breakthrough in medicine & drug discovery

1 day ago

01:20:07

Foundation Models

Mind the Gap (In your Agent Observability) — Amy Boyd & Nitya Narasimhan, Microsoft

1 day ago

25:53

Foundation Models

The Trillion Dollar Agentic Workflow Opportunity Is Here

1 day ago

18:37

Foundation Models

CI/CD Is Dead, Agents Need Continuous Compute and Computers — Hugo Santos and Madison Faulkner

2 days ago

20:09

Foundation Models

Pinecone Just Demoted Vector Search. Here’s the Knowledge Layer.

2 days ago

14:27

Foundation Models

Claude Makes Dashboards Too Easy. That’s the Problem.

2 days ago