Research to Reality: Bringing Frontier ML Research to Production – Vaidas Razgaitis, Higharc

Agents & Automation6 days ago

Research to Reality: Bringing Frontier ML Research to Production – Vaidas Razgaitis, Higharc

Descriptions:

Vaidas Razgaitis, senior research engineer at Higharc’s AI labs team, addresses a challenge familiar to any company running serious ML workloads: how do you move a research prototype — built by ML researchers comfortable with papers and experiments but not production-grade APIs — into a robust, maintainable production system built by software engineers who have never trained a model?

Higharc’s product spans computer vision for parsing hand-sketched floor plans, reasoning agents, custom transformer models, diffusion-based image generation, and more, making this handoff problem acute. Razgaitis organizes his solution around three levers. First, research legibility: every prototype must produce a “research prototype taxonomy document” — a TDD-style spec covering domain-specific data representations, business goals, type contracts, persistence layer design, system architecture, and a decomposition plan for production migration. Second, code structure: Higharc maintains a separate Python monorepo of cleanly isolated microservices, with roughly a one-to-one researcher-to-microservice ratio, each built as a layered FastAPI application (API routers → controllers → business logic → data layer) behind a central gateway. Third, production decomposition: the taxonomy document’s architecture section provides the roadmap for standing up each prototype, with agent-navigable documentation built in from the start.

The talk is practical and replicable, offering a transferable process for any team where ML researchers and platform engineers must collaborate without speaking each other’s language.

📺 Source: AI Engineer · Published June 28, 2026
🏷️ Format: Workflow Case Study

1 Item

Channels

No Image Available

AI Engineer

Tags

Docker FastAPI Modal Notion The Pragmatic Engineer

Prev

HERMES AGENT + Stripe Payments + NVIDIA Nemotron is INSANE!

Next

Run DeepSeek DSpark on Qwen3 Locally and Reproduce the Speedup

18 Related Posts

Related Posts

21:17

Agents & Automation

Claude Fable 5 Is Finally Back: 5 Must-Try Use Cases Before July 7

3 days ago

14:23

Agents & Automation

I Built An App (Here’s How I Make Sure It Stays Competitive)

4 days ago

14:41

Agents & Automation

Using RL Agent to Detect and Remediate ETL Pipeline Failures – Anna Marie Benzon

5 days ago

15:03

Agents & Automation

My First Winning Agentic AI Trading Strategy On Polymarket

2 weeks ago

20:04

Agents & Automation

After spent 30+ hrs building loops…

2 weeks ago

01:02:25

Agents & Automation

Matt Pocock’s Agentic Engineering Workflow (just copy him)

2 weeks ago