Your Coding Agent Should Do AI System Engineering — Ben Burtenshaw, Hugging Face

Coding & Dev Tools2 months ago

Your Coding Agent Should Do AI System Engineering — Ben Burtenshaw, Hugging Face

Descriptions:

Ben Burtenshaw, an engineer at Hugging Face, makes the case that coding agents have crossed a capability threshold where they can now tackle AI systems engineering problems—not just application-layer code. His talk at AI Engineer 2026 is structured around three progressively autonomous challenges, each framed as a boss fight: writing optimized CUDA kernels, autonomously fine-tuning LLMs, and running a multi-agent automated research lab.

On CUDA kernels, Burtenshaw demonstrates Hugging Face’s `kernels` library, which lets agents generate hardware-specific inference optimizations and benchmark them against a compatibility matrix. An agentic workflow targeting Qwen 3 8B on an H100 produced a 94% inference speedup—not state-of-the-art, but representative of low-hanging fruit available when models aren’t optimized for specific GPU generations. He also introduces `upskill`, an open-source evaluation tool that benchmarks multiple models (GPT, Kimi, Haiku) on the same structured task, enabling cost-aware model substitution without accuracy loss.

The second section shows how a single natural-language prompt can trigger a full LLM fine-tuning run on Hugging Face Hub infrastructure, using either standard HF CLI skills or the Onslaught framework for cheaper compute. The talk closes with AutoLab—a multi-agent research system inspired by Andrej Karpathy’s Auto Research project—which automates hypothesis generation, experimentation, and evaluation in a continuous loop. Burtenshaw argues the prerequisite for all of this is standardized repositories on the Hub, giving agents reliable surfaces to act on.

📺 Source: AI Engineer · Published May 21, 2026
🏷️ Format: Hands On Build

1 Item

Channels

No Image Available

AI Engineer

Tags

AMD Andrej Karpathy auto research Claude CUDA H100 Hugging Face OpenCode

Prev

AI Dev 26 x SF | Eda Zhou & Mahdi Ghodsi: Building Personal AI Agents with Open Source Models

Next

DeepSeek’s New AI Is A Game Changer

18 Related Posts

Related Posts

14:58

Coding & Dev Tools

The Ultimate Knowledge Base: Bring YouTube Into Your AI Second Brain

2 hours ago

12:23

Coding & Dev Tools

Microsoft Fara1.5 27B: Local Install + Real Browser Automation Demo

1 day ago

23:27

Coding & Dev Tools

I Built a $10,000 Website for $13 (Claude + Higgsfield)

1 day ago

25:27

Coding & Dev Tools

Full Tutorial: From Idea to App with Claude Design and Claude Code in 25 Minutes

1 day ago

09:07

Coding & Dev Tools

Your AI Agent Is Burning Money (Fix It)

1 day ago

09:16

Coding & Dev Tools

DeepSeek V4 Flash Fully Local — 32 tok/s on a Single Chip

3 days ago