Lessons from Trillion Token Deployments at Fortune 500s — Alessandro Cappelli, Adaptive ML

Lessons from Trillion Token Deployments at Fortune 500s — Alessandro Cappelli, Adaptive ML

More

Descriptions:

Alessandro Cappelli, co-founder and Chief Customer Officer of Adaptive ML, delivers a conference talk at AI Engineer drawing on production deployments across Fortune 500 clients including AT&T, Manulife, and CCS. The central thesis: reinforcement learning — not prompt engineering or supervised fine-tuning — is the mechanism that separates AI pilots from systems that actually reach and stay in production. Cappelli cites that 95% of generative AI pilots fail to make it to production, attributing this to what he calls the “myth of the last mile” — the false assumption that the hard part is building an impressive demo rather than surviving the continuous improvement cycle that follows.

Adaptive ML’s RL ops platform enables enterprises to retrain and refine specialized language models using real business feedback, KPIs, and LLM-as-judge reward signals in a closed loop. Cappelli explains that RL-trained smaller models can match frontier model performance at a fraction of the inference cost — a concrete example being AT&T, which spends millions annually on call transcript summarization and could dramatically reduce that figure by replacing a large hosted model with a compact RL-optimized one.

The talk extends into agent training, explaining how Adaptive ML plugs RL-trained models directly into existing agentic workflows — as done with Manulife’s established agent infrastructure — using business outcomes as reward signals. When labeled data is scarce, Cappelli describes using the RL environment itself as a synthetic data pipeline via rejection sampling, bootstrapping the first training run. A companion workshop by Adaptive ML colleagues Letizia and Joao, available on their engineering site, demonstrates the full pipeline hands-on.


📺 Source: AI Engineer · Published May 12, 2026
🏷️ Format: Deep Dive

1 Item

Channels