AI Dev 26 x SF | Adit Abraham: Better Agents with Better Data

Foundation Models2 months ago

AI Dev 26 x SF | Adit Abraham: Better Agents with Better Data

Descriptions:

Adit Abraham, CEO of Reductto, presents at AI Dev 26 x SF on one of the most underappreciated bottlenecks in enterprise AI deployments: the quality of data fed to agents. Reductto, backed by Andreessen Horowitz and Benchmark with $108 million raised, has processed over 3 billion documents for clients including Fortune 10 companies, major hedge funds, and AI-native firms like Harvey, Scale AI, and Rogo.

Abrahim walks through why PDFs remain stubbornly difficult to parse accurately despite decades of work on the problem, and frames the core tension between traditional computer vision OCR (deterministic, bounding-box-preserving, fast) and frontier language models (context-aware but prone to “reasoning” on content mid-extraction, such as computing totals instead of reading them). Reductto’s answer is a technique they call agentic OCR, which applies speculative decoding — a single forward pass that identifies token-level corrections — to get accuracy improvements while preserving the structural characteristics that downstream agents depend on.

The talk also covers best practices for formatting extracted data: markdown works well for clean tabular data because LLMs reason on it efficiently, but complex layouts with merged cells or nested structures require different representations. Abraham argues that teams routinely stop at extraction and overlook how output format shapes agent performance. He closes with a forward-looking discussion of confidence scoring to triage documents for agent-in-the-loop versus human-in-the-loop review, and the path toward human-level performance on the hardest document understanding tasks.

📺 Source: DeepLearningAI · Published May 20, 2026
🏷️ Format: Deep Dive

1 Item

Channels

No Image Available

DeepLearningAI

Tags

a16z Anthropic Harvey Scale AI

Prev

Wizstar AI Video Generator – Full Marketing Video From Just an Amazon Link | Full Walkthrough

Next

This AI Model Has No VAE! Testing HiDream-O1’s Unified Transformer

18 Related Posts

Related Posts

21:09

Foundation Models

Persona Engineering: A Field Guide to AI Synthetic Personas — Ishan Anand, InsightSciences.ai

1 day ago

21:39

Foundation Models

Serving 2 Million Models Without Melting: Scaling the Hugging Face Hub — Arek Borucki, Hugging Face

2 days ago

06:40

Foundation Models

AMD Releases First Ever AI model: Instella-MoE-16B-A3B-Think

2 days ago

24:01

Foundation Models

US AI Dominance Is Over: Here’s Why

3 days ago

17:31

Foundation Models

The Messy Reality of Scale: Synthetic Data and Pre-Training — Marah Abdin & Robert McHardy, poolside

4 days ago

17:57

Foundation Models

Loop Engineering from First Principles — Kyle Mistele, HumanLayer

5 days ago