How agent o11y differs from traditional o11y — Phil Hetzel, Braintrust

Foundation Models2 months ago

How agent o11y differs from traditional o11y — Phil Hetzel, Braintrust

Descriptions:

Phil Hetzel, head of solutions engineering at Braintrust, delivers a conference talk breaking down exactly why agent observability is a fundamentally different engineering problem from the traditional observability most teams already have in place. The talk is framed around the limits of established tools like Grafana and Datadog, which were designed to answer one question: is the system up, and is it performing within technical SLAs? That scope, Hetzel argues, is categorically insufficient for AI agents.

The core distinctions he works through include non-determinism (agents produce highly variable outputs unlike deterministic application code paths), the structural complexity of agent traces (nested spans mixing model calls, tool calls, and large volumes of unstructured text), and a dual read-pattern challenge: the platform must simultaneously support real-time trace streaming for live monitoring and analytical SQL-style querying for evaluation pipelines. Hetzel explains that Braintrust built a purpose-built database from the ground up to handle these requirements — incorporating a write-ahead log for instant trace visibility, optimized indexing for filter queries, and a forked version of Tantivy, an open-source full-text search framework, to enable text-based trace queries such as retrieving every session that referenced a specific word or phrase.

The talk is grounded in Hetzel’s 12 years of consulting experience, including leading the global Databricks practice at Slalom Consulting, and is aimed at AI engineers and platform teams deciding how to instrument production agent systems and understand where traditional observability tooling falls short.

📺 Source: AI Engineer · Published May 28, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

AI Engineer

Tags

BrainTrust Databricks DataDog

Prev

Claude lead gen

Next

Anthropic just dropped Opus 4.8… (WOAH)

18 Related Posts

Related Posts

21:09

Foundation Models

Persona Engineering: A Field Guide to AI Synthetic Personas — Ishan Anand, InsightSciences.ai

1 day ago

21:39

Foundation Models

Serving 2 Million Models Without Melting: Scaling the Hugging Face Hub — Arek Borucki, Hugging Face

2 days ago

06:40

Foundation Models

AMD Releases First Ever AI model: Instella-MoE-16B-A3B-Think

2 days ago

24:01

Foundation Models

US AI Dominance Is Over: Here’s Why

3 days ago

17:31

Foundation Models

The Messy Reality of Scale: Synthetic Data and Pre-Training — Marah Abdin & Robert McHardy, poolside

4 days ago

20:24

Foundation Models

From Agent Traces to Agent Simulations — Rustem Feyzkhanov, Snorkel AI

5 days ago