Chip design from the bottom up – Reiner Pope

Foundation Models2 months ago

Chip design from the bottom up – Reiner Pope

Descriptions:

Dwarkesh Patel sits down with Reiner Pope, CEO of AI chip startup MatX, for a rare bottom-up walkthrough of how AI chips are actually designed — starting from the most fundamental logic gates and building systematically to the systolic arrays that power modern matrix multiplication.

Pope begins with AND gates and full adders, then shows how a multiply-accumulate unit is constructed from partial products, explaining why AI chips use lower-precision arithmetic (4-bit) for multiplication but higher-precision (8-bit) accumulation: repeated summation compounds rounding errors in a way single multiplications do not. He then scales up to systolic arrays, unpacking how they solve the memory bandwidth problem by storing weight matrices locally and trickling data in slowly — keeping wiring across the array boundary proportional to array width rather than width-times-height. The insight that runs through every level of the discussion is the same: maximize compute relative to communication, whether at the gate level, the chip level, or across multi-chip inference clusters.

The conversation draws direct connections to real-world chip architectures — why Cerebras uses wafer-scale SRAM, why Nvidia GPUs face an off-chip HBM bottleneck, and why disaggregated prefill/decode inference (which Pope links to Nvidia’s Groq acquisition) has now become commercially viable. For engineers, researchers, or technical leaders who want to understand why AI hardware is designed the way it is — and how those design decisions propagate into serving costs and latency — this is an unusually rigorous and accessible primer from someone actively building in the space.

📺 Source: Dwarkesh Patel · Published May 22, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

Dwarkesh Patel

1 Item

Companies

No Image Available

Nvidia

Tags

Dwarkesh Patel GPU Groq HBM Nvidia SemiAnalysis TPU

Prev

This is absolutely CRAZY

Next

printf is Actually a Secret Virtual Machine – And a Giant Security Hole!

18 Related Posts

Related Posts

21:09

Foundation Models

Persona Engineering: A Field Guide to AI Synthetic Personas — Ishan Anand, InsightSciences.ai

1 day ago

21:39

Foundation Models

Serving 2 Million Models Without Melting: Scaling the Hugging Face Hub — Arek Borucki, Hugging Face

2 days ago

06:40

Foundation Models

AMD Releases First Ever AI model: Instella-MoE-16B-A3B-Think

2 days ago

24:01

Foundation Models

US AI Dominance Is Over: Here’s Why

3 days ago

17:31

Foundation Models

The Messy Reality of Scale: Synthetic Data and Pre-Training — Marah Abdin & Robert McHardy, poolside

4 days ago

20:24

Foundation Models

From Agent Traces to Agent Simulations — Rustem Feyzkhanov, Snorkel AI

5 days ago