OmniCoder-9B Running Locally: I Tried to Break It With Real Engineering Tasks

Coding & Dev Tools2 months ago

OmniCoder-9B Running Locally: I Tried to Break It With Real Engineering Tasks

Descriptions:

Fahd Mirza puts OmniCoder-9B — a coding-focused model from Tesslr, fine-tuned on the Qwen3.5 9B hybrid architecture — through a hands-on local evaluation on an Nvidia RTX 6000 with 48 GB VRAM. The model claims benchmark scores of 83.8% on GPQA Diamond and was trained on 425,000 curated agentic trajectories sourced from frontier model outputs, giving it habits like reading before writing, responding to compiler diagnostics, and making minimal diffs rather than rewriting entire files.

Mirza serves the model using vLLM, configures recommended hyperparameters (temperature 0.6, top-K 20, top-P 0.95) through Open WebUI, and then prompts it with a challenging task: generate a self-contained HTML file simulating a Kerbal Space Program-style rocket booster simulator. The resulting output includes physics simulation, canvas rendering, and game-state management — and actually runs in the browser. The model also supports a 262K-token context window and chain-of-thought reasoning via think tags, and is fully open-weight under Apache 2.0.

The video is most valuable for its behavioral analysis angle: rather than just quoting leaderboard numbers, Mirza highlights the agentic coding habits baked into OmniCoder-9B through its training data, and explains why those behaviors matter more than raw benchmark scores for real automated coding pipelines. Developers evaluating small open-weight coding models for local agent setups will find this a useful practical reference.

📺 Source: Fahd Mirza · Published March 14, 2026
🏷️ Format: Hands On Build

1 Item

Channels

No Image Available

Fahd Mirza

Tags

Claude Opus 4.6 GPT 5.4 Oracle Tesla VLLM

Prev

I Tried the AI Coding Tool That Could Replace Cursor

I Tried the AI Coding Tool That Could Replace Cursor

Next

Inside Ramp, the $32B Company Where AI Agents Run Everything | Geoff Charles

Inside Ramp, the $32B Company Where AI Agents Run Everything | Geoff Charles

18 Related Posts

Related Posts

10:06

Coding & Dev Tools

Toto 2.0: Datadog’s Observability AI Model – Full Install + Live Dashboard

1 hour ago

18:19

Coding & Dev Tools

My Hands-Free AI Streaming Setup (CodeRabbit + Claude Code)

1 hour ago

23:22

Coding & Dev Tools

Claude Just Replaced My Financial Advisor (Tutorial)

1 hour ago

06:45

Coding & Dev Tools

How to Make Your AI Agent Crash Proof in 1 Install (Free)

1 hour ago

15:13

Coding & Dev Tools

Make the PERFECT Videos with Claude Code (Full Workflow)

1 day ago

01:04:27

Coding & Dev Tools

Make your own event-sourced agent harness using stream processors — Jonas Templestein, Iterate

1 day ago