Semble + OpenCode + Ollama: Local Code Search MCP for AI Agents

Tutorials2 weeks ago

Semble + OpenCode + Ollama: Local Code Search MCP for AI Agents

Descriptions:

Fahd Mirza demonstrates Symbol, a code search library designed specifically for AI coding agents, integrated with the OpenCode terminal agent and a locally running Ollama model. The problem Symbol addresses is context-window waste: when agents use grep and read entire files to answer questions about a codebase, they dump thousands of irrelevant lines into the model’s context. Symbol replaces this with a hybrid retrieval system that indexes an entire repository in 250 milliseconds and returns precise code chunks in 1.5 milliseconds, all running on CPU with no API keys or GPU required.

Under the hood, Symbol combines a 16-million-parameter static embedding model—distilled from a 137-million-parameter transformer and available on Hugging Face—with BM25 keyword matching fused via reciprocal rank fusion (RRF). The result delivers 99% of the retrieval quality of full transformer models while using 98% fewer tokens than the grep-and-read approach, according to the project’s benchmarks.

The walkthrough uses LooseBox, a 2,000-line C++ CUDA codebase with custom speculative decoding kernels, as the test repository. Mirza configures Symbol as an MCP server in OpenCode’s config file, launches the agent from the repo root, and asks natural language questions about the codebase. The Ollama-powered Qwen 1.5 35B model responds with precise file locations, line numbers, and technical explanations—including details about DD tree verification, attention masks, and SSM rollback—without reading any complete files.

📺 Source: Fahd Mirza · Published May 01, 2026
🏷️ Format: Tutorial Demo

1 Item

Channels

No Image Available

Fahd Mirza

Tags

Fahd Mirza MCP Ollama OpenCode

Prev

Claude vs ChatGPT: Which AI Trades the Best?

Next

How to Use Claude Code for FREE (2026)

18 Related Posts

Related Posts

14:22

Tutorials

Codex Mobile Released and It’s Insane

6 minutes ago

10:54

Tutorials

Talkie: I Ran a 1930 AI Model Locally and Talked to People from the Past

1 day ago

03:02

Tutorials

Installing Claude Code

1 day ago

08:17

Tutorials

OpenAI Codex Now Works from Anywhere (Dispatch Killer?)

1 day ago

08:41

Tutorials

Luce DFlash Meets OpenClaw – Local AI Agents at 2x Speed with Qwen3.6-27B

2 days ago

24:07

Tutorials

Hermes Agent powered by local models on the DGX Spark is basically magic

2 days ago