Stop Making Models Bigger, Make Them Behave — Kobie Crawdord, Snorkel

Foundation Models2 months ago

Stop Making Models Bigger, Make Them Behave — Kobie Crawdord, Snorkel

Descriptions:

Kobie Crawford, developer advocate at Snorkel AI, presented research conducted in partnership with UC Berkeley’s RLLM lab showing that a 4-billion-parameter model fine-tuned on a high-quality, expert-verified dataset can outperform a 235-billion-parameter frontier model on a structured tool-use task for financial analysis.

The research targets a failure mode Crawford argues is underappreciated in large models: despite superior general reasoning, frontier models often lack the disciplined, consistent tool-calling behavior that enterprise production systems require. In a live demonstration, a 235B model repeatedly queried nonexistent database tables when given a financial analysis task, eventually hallucinating an answer after two failed tool calls. The fine-tuned 4B model, trained using reinforcement learning on the same task type, selected the correct tools and returned a verifiable answer. The key difference was not reasoning capability but behavioral consistency under constrained, schema-aware tool use.

Snorkel’s data pipeline relies on domain experts — PhD-level researchers and senior industry practitioners — who generate and verify training tasks to ensure answer verifiability and task-to-domain fit before RL training begins. Crawford’s broader argument is that for well-scoped enterprise tasks, targeted data quality frequently outperforms raw model scale, with significant downstream benefits for inference cost, latency, and deployment security. The talk is directly relevant to ML engineers and enterprise architects weighing model size, fine-tuning investment, and reliability requirements for production AI systems.

📺 Source: AI Engineer · Published June 10, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

AI Engineer

Tags

Hugging Face Prime Intellect Snorkel AI

Prev

How To Use Claude Fable 5 – Tips And Tricks Most People Miss

Next

Google’s Agents CLI: The CLI + Skills Combination to Ship AI Agents EASILY

18 Related Posts

Related Posts

21:09

Foundation Models

Persona Engineering: A Field Guide to AI Synthetic Personas — Ishan Anand, InsightSciences.ai

1 day ago

21:39

Foundation Models

Serving 2 Million Models Without Melting: Scaling the Hugging Face Hub — Arek Borucki, Hugging Face

2 days ago

06:40

Foundation Models

AMD Releases First Ever AI model: Instella-MoE-16B-A3B-Think

2 days ago

24:01

Foundation Models

US AI Dominance Is Over: Here’s Why

3 days ago

17:31

Foundation Models

The Messy Reality of Scale: Synthetic Data and Pre-Training — Marah Abdin & Robert McHardy, poolside

4 days ago

23:13

Foundation Models

Evaling Video Slop — Maor Bril, Character.ai

5 days ago