AI Dev 26 x SF | Pratik Verma: Observability Agent to Find & Fix Issues in AI Agents

Tutorials2 months ago

AI Dev 26 x SF | Pratik Verma: Observability Agent to Find & Fix Issues in AI Agents

Descriptions:

Pratik Verma presents a practical framework for moving AI agents from impressive demos to production-ready systems at AI Dev 26 San Francisco. His core finding: agents rarely fail because of logical coding errors — they fail in real-world deployment due to ambiguous inputs, missing context, and edge cases that are difficult to anticipate during development, making conventional software testing approaches largely insufficient.

The centerpiece of the talk is Monle, an open-source observability framework designed specifically for agentic traces. Standard OpenTelemetry instrumentation struggles to capture failures inside framework-internal code — such as LangGraph or LangChain internals — where the most consequential agent behavior actually occurs. Monle solves this with structured telemetry that minimizes post-collection cleanup. The talk also announces a new Monle capability enabling billing instrumentation for outcome-based pricing, so the same library that helps find failures can facilitate payment when agents succeed.

Verma demonstrates the Okahoo IDE extension for VS Code and Cursor, which surfaces relevant traces directly in the editor and passes them as context to coding agents like GitHub Copilot or Claude CLI for targeted fixes. A second demo tackles quality failures — agents that respond politely but incorrectly, generating no exceptions and leaving no trace in standard logs — using LLM-as-judge evaluations applied at the trace level. The resulting testing framework supports assertions on tool call inputs and outputs, not just final responses, enabling a continuous deploy-observe-evaluate-fix reliability loop. The Okahoo extension is available free via QR code at the talk.

📺 Source: DeepLearningAI · Published May 20, 2026
🏷️ Format: Tutorial Demo

1 Item

Channels

No Image Available

DeepLearningAI

Tags

CodeRabbit Cursor GitHub GitHub Copilot LangChain MCP

Prev

Wizstar AI Video Generator – Full Marketing Video From Just an Amazon Link | Full Walkthrough

Next

This AI Model Has No VAE! Testing HiDream-O1’s Unified Transformer

18 Related Posts

Related Posts

08:04

Tutorials

Herdr: Run Multiple AI Coding Agents in Parallel from Your Terminal

2 hours ago

15:54

Tutorials

Buzz Huddle Test: 4 Humans, 2 AI Agents

2 hours ago

22:53

Tutorials

The Viral $1 Website Effect That Looks Like $10K (Tutorial)

1 day ago

20:17

Tutorials

Paste This Into Claude, Never Hit a Token Limit Again

1 day ago

15:54

Tutorials

AI Video 101: How to Master AI Videos (Beginner to Advanced)

1 day ago

08:12

Tutorials

How to Run Kimi K3 Locally (3 Ways)

1 day ago