Scaling the Next Paradigm of Heterogeneous Intelligence — Adrian Bertagnoli, Callosum

Foundation Models2 months ago

Scaling the Next Paradigm of Heterogeneous Intelligence — Adrian Bertagnoli, Callosum

Descriptions:

Adrian Bertagnoli, founding engineer at Colossyan, delivers a conference talk arguing that the next major paradigm shift in AI will come from heterogeneous intelligence — systems combining different model architectures, sizes, and hardware types rather than scaling a single model on uniform compute. He grounds the argument in a mathematical formalization called the principle of maximum heterogeneity, drawing on analogues from neuroscience, economics, and ecology to demonstrate that heterogeneous systems outperform homogeneous ones under any reasonable constraints. The talk frames current trends — mixture-of-experts replacing dense models, multi-agent systems replacing single LLM calls, prefill-decode disaggregation at the hardware layer — as early signals of a broader architectural transition already underway.

The first concrete primitive Bertagnoli introduces is heterogeneous recursion, an extension of MIT’s recursive language model paper, which showed that context complexity (not just context length) causes performance degradation at around 60–30% context window occupancy. Colossyan’s extension maps recursive sub-tasks to different models and hardware based on computational demand, achieving results comparable to GPT-5 and GPT-5.2 on the Ulong benchmark while running significantly faster and at lower cost — with GPT-5 clocking roughly 2,000 seconds on the same tasks.

The second primitive is multimodal video action language models (VALMs), which integrate visual, language, and action capabilities for agents operating in video-rich environments. Bertagnoli frames the long-term trajectory as a co-evolution of AI software and hardware converging on full vertical integration — specialized silicon matched to specialized model types — as the dominant architecture for solving complex, multi-step real-world problems that decompose into fundamentally different sub-tasks.

📺 Source: AI Engineer · Published May 24, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

AI Engineer

Tags

Cerebras ChatGPT GPT-5.2 Kimi K2.5 MIT SambaNova

Prev

SpaceX’ $75B+ Historic IPO, GPT5.5 Outperforms Polymarket, AI Solves 80yr old math problem | EP #257

Next

Master LTX Director: The Ultimate Timeline Control for AI Video| Multi-Frame Reference

18 Related Posts

Related Posts

21:09

Foundation Models

Persona Engineering: A Field Guide to AI Synthetic Personas — Ishan Anand, InsightSciences.ai

23 hours ago

21:39

Foundation Models

Serving 2 Million Models Without Melting: Scaling the Hugging Face Hub — Arek Borucki, Hugging Face

2 days ago

06:40

Foundation Models

AMD Releases First Ever AI model: Instella-MoE-16B-A3B-Think

2 days ago

24:01

Foundation Models

US AI Dominance Is Over: Here’s Why

3 days ago

17:31

Foundation Models

The Messy Reality of Scale: Synthetic Data and Pre-Training — Marah Abdin & Robert McHardy, poolside

4 days ago

17:57

Foundation Models

Loop Engineering from First Principles — Kyle Mistele, HumanLayer

5 days ago