Code World Model: Building World Models for Computation – Jacob Kahn, FAIR Meta

Foundation Models7 months ago

Code World Model: Building World Models for Computation – Jacob Kahn, FAIR Meta

Descriptions:

Jacob Kahn, a researcher at Meta FAIR, presents the Code World Model (CWM)—a research project aimed at teaching AI systems to reason about and plan through code by explicitly modeling program execution rather than treating code as pure syntax.

The core insight of CWM is that standard autoregressive language models see only tokens, but a world model can predict execution traces: step-by-step state transitions showing local variables, memory, and control flow as a program runs. By framing code execution as a transition function (state + action → next state), CWM creates a structured chain-of-thought that models can both learn from and generate, enabling them to simulate execution without running code in a real environment. This simulation capability is designed to make agentic code workflows significantly more efficient.

Kahn also walks through CWM’s training pipeline—pre-training on trillions of tokens, domain-specific mid-training, long-context extensions, instruction fine-tuning, and an asynchronous RL loop with samplers, trajectory scorers, and a bash-heavy tool environment. A deliberate design choice to minimize available tools forces the model to master terminal commands, closely mirroring how a real software engineer operates. The talk is particularly valuable for researchers and practitioners interested in how execution-aware training, world models, and RL post-training intersect to produce stronger coding agents.

📺 Source: AI Engineer · Published December 17, 2025
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

AI Engineer

Tags

GitHub Hugging Face

Prev

Shipmas Day 12: AI Music Video Generator App

Shipmas Day 12: AI Music Video Generator App

Next

AI Consulting in Practice – NLW, Superintelligent, @AIDailyBrief⁩

AI Consulting in Practice – NLW, Superintelligent, @AIDailyBrief⁩

18 Related Posts

Related Posts

25:21

Foundation Models

Deepseek drops another HUGE breakthrough

24 hours ago

09:01

Foundation Models

NVIDIA’s Two-Tower Model Generates Text 2.4x Faster Without Losing Quality

2 days ago

07:27

Foundation Models

This New AI Model Changes Everything

3 days ago

14:10

Foundation Models

Your Agent Failed in Prod. Good Luck Reproducing It. – Tisha Chawla & Susheem Koul, Microsoft

5 days ago

30:38

Foundation Models

The Future Is Domain-Specific Agents – Justin Schroeder, StandardAgents

5 days ago

07:14

Foundation Models

Deterministic Infra for Non-Deterministic AI Agents – Nishant Gupta, Meta Superintelligence Labs

5 days ago