The thinking lever

Foundation Models1 week ago

The thinking lever

Descriptions:

Anthropic product manager Matt Bleifer delivers a detailed technical explanation of how Claude uses test-time compute — also called inference-time compute — to tackle complex problems, and what levers developers can pull to control this behavior. The talk establishes that scaling compute at inference time follows similar patterns to training-time scaling: more time and tokens spent on a problem consistently yields better results across agentic coding, computer use, and PhD-level reasoning tasks.

A traffic simulation demo concretely illustrates the stakes. Running Opus 4.7 on low effort produces a functional but basic result in roughly 50 seconds using around 4,600 output tokens. Cranking to high effort doubles both time and tokens and produces a meaningfully better simulation with an intelligent driver model. Maxing out effort consumes 10x the tokens and time of the low setting, yielding the best graphics, physics, and traffic behavior. Bleifer breaks down three distinct token types — thinking tokens (chain-of-thought scratch pad), tool call tokens, and response text — and traces the evolution from single-block pre-response thinking, to interleaved thinking between tool calls, to the current adaptive thinking paradigm.

Adaptive thinking, the default benchmark setting since Opus 4.6, gives Claude the freedom to think at any point during a response — before or after tool calls, in the middle of text generation — without any fixed constraints on timing or volume. The presentation covers the effort API parameter, budget tokens for explicit thinking limits, and practical guidance on matching effort levels to task complexity.

📺 Source: Claude · Published May 08, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

Claude

1 Item

Companies

No Image Available

Anthropic

Tags

Anthropic Claude Claude Code Claude Opus Claude Opus 4.6

Prev

Claude For Powerpoint Tutorial – How To Use Claude With Powerpoint

Next

we JUST figured out how AI thinks…

18 Related Posts

Related Posts

31:55

Foundation Models

The biggest AI breakthrough in medicine & drug discovery

1 day ago

01:20:07

Foundation Models

Mind the Gap (In your Agent Observability) — Amy Boyd & Nitya Narasimhan, Microsoft

1 day ago

25:53

Foundation Models

The Trillion Dollar Agentic Workflow Opportunity Is Here

1 day ago

18:37

Foundation Models

CI/CD Is Dead, Agents Need Continuous Compute and Computers — Hugo Santos and Madison Faulkner

2 days ago

20:09

Foundation Models

Pinecone Just Demoted Vector Search. Here’s the Knowledge Layer.

2 days ago

14:27

Foundation Models

Claude Makes Dashboards Too Easy. That’s the Problem.

2 days ago