Qwen3.5 35B Meets OpenClaw: Run with Llama.cpp Locally

Tutorials3 months ago

Qwen3.5 35B Meets OpenClaw: Run with Llama.cpp Locally

Descriptions:

Fahd Mirza walks through how to run the Qwen3.5 35B model entirely locally using llama.cpp and then integrate it with OpenClaw, Anthropic’s open-source command-line agent framework — no API keys or cloud services required. The setup runs on an NVIDIA RTX 6000 GPU with 48GB of VRAM, drawing around 36.45GB during operation, giving a realistic picture of the hardware required for a model at this scale.

The tutorial covers the full installation sequence: setting up Node.js and npm via nvm, installing OpenClaw and running its onboarding process, and then editing OpenClaw’s configuration file to point its provider at the locally running llama.cpp server on port 8080. The key configuration elements — provider name, base URL, API completion path, and model ID — are shown directly, and Mirza provides the complete config file and command list in a pinned comment for easy reproduction.

Once the gateway service is started, the Qwen3.5 35B model becomes accessible through OpenClaw and can be connected to external channels such as Telegram. This makes the tutorial relevant for developers looking to self-host capable open-weight models with a feature-rich agent interface, avoiding recurring inference costs while maintaining full control over their data and infrastructure.

📺 Source: Fahd Mirza · Published February 25, 2026
🏷️ Format: Tutorial Demo

1 Item

Channels

No Image Available

Fahd Mirza

Tags

llama.cpp npm OpenClaw Qwen 3.5

Prev

Gemini 3.1 Pro in Antigravity can do anything… just watch

Gemini 3.1 Pro in Antigravity can do anything… just watch

Next

This simple Claude Cowork system saves 5 hours a week

This simple Claude Cowork system saves 5 hours a week

18 Related Posts

Related Posts

14:22

Tutorials

Codex Mobile Released and It’s Insane

7 minutes ago

10:54

Tutorials

Talkie: I Ran a 1930 AI Model Locally and Talked to People from the Past

1 day ago

03:02

Tutorials

Installing Claude Code

1 day ago

08:17

Tutorials

OpenAI Codex Now Works from Anywhere (Dispatch Killer?)

1 day ago

08:41

Tutorials

Luce DFlash Meets OpenClaw – Local AI Agents at 2x Speed with Qwen3.6-27B

2 days ago

24:07

Tutorials

Hermes Agent powered by local models on the DGX Spark is basically magic

2 days ago