Radically Better Reasoning: Elicit’s Andreas Stuhlmüller & Jungwon Byun on World Models for Research

Interviews2 weeks ago

Radically Better Reasoning: Elicit’s Andreas Stuhlmüller & Jungwon Byun on World Models for Research

Descriptions:

Andreas Stuhlmüller and Jungwon Byun, co-founders of Elicit, join the Cognitive Revolution podcast to discuss how their AI platform for scientific research is evolving from process supervision toward a new concept they call external world models. Elicit — which now works with seven of the top 20 life sciences companies on tasks ranging from drug target ranking to regulatory defense of launch pricing — originally bet that rewarding models for step-by-step reasoning quality would produce more reliable outputs than training on final answers alone. The challenge: powerful frontier reasoning models increasingly hide their chain of thought.

Their solution is a domain-specific language (DSL) that defines reasoning primitives as discrete microservices, allowing frontier models to dynamically compose structured workflows guaranteed to execute as defined. The founders explain why LLMs are still too easy to manipulate for high-stakes decision support, and introduce the concept of “certificates of reasoning” — verifiable proof that prescribed reasoning steps were actually carried out. They also discuss their internal automation system “the line,” which now delivers 30–50 code changes per week, their token spending as a company, and where Gemini fits in their stack.

The conversation ranges into epistemological territory — how to reduce hard-to-verify tasks to easy-to-verify ones, and why external structured representations of evidence may ultimately be more reliable than in-weights learning. Essential listening for anyone building AI into regulated, high-stakes workflows.

📺 Source: Cognitive Revolution “How AI Changes Everything” · Published June 17, 2026
🏷️ Format: Podcast

Tags

Anthropic ChatGPT Claude Claude Mythos Claude Opus 4.5 Gemini Gemini 3 Pro Slack

Prev

China Gets LOCKED OUT of SpaceX and America’s Biggest IPOs (ft. Ed Elson) | China Decode

Next

9 AI Agent Trends That Will Put You Ahead of 99% of People

18 Related Posts

Related Posts

07:36

Interviews

Microsoft Shifts Strategy on Enterprise AI

2 days ago

02:00:20

Interviews

Claude Fable 5 Is BACK (And It’s Different)

2 days ago

01:18:07

Interviews

Coinbase Cuts AI Spend by 50% | Kalshi’s $40B Valuation & Impending IPO | The Year for SaaS Roll-Ups

2 days ago

44:07

Interviews

Tesla Deliveries Jump 25% | Bloomberg Tech 7/02/2026

2 days ago

05:14

Interviews

Nuclear Reactor Powers Nvidia AI Chip in US First

2 days ago

01:24:35

Interviews

ARC-AGI-3 Explained by the Team That’s Winning It

3 days ago