Running Local AI on AMD

Coding & Dev Tools2 months ago

Running Local AI on AMD

Descriptions:

Sam Witteveen takes a hands-on look at running local AI on an AMD workstation equipped with a Ryzen Threadripper 9980X processor and the Radeon AI Pro R9 700 GPU with 32GB of VRAM. The video addresses a question gaining urgency in 2026: as frontier model costs climb—especially for agentic and reasoning workloads that burn tokens at a rate chat never did—can prosumer AMD hardware serve as a credible alternative to cloud APIs for serious AI work?

The walkthrough covers the full local stack from the ground up. LM Studio and Ollama both now ship with ROCm runtime support and run out of the box on AMD cards, while 32GB of VRAM means Witteveen can load recommended 4-bit or 8-bit quantizations of Qwen 3, Gemma, DeepSeek, and similar open-weight models without significant compromise. He then moves into the developer layer: PyTorch offers official ROCm wheels installable via a single pip command, the Hugging Face Transformers library runs without code changes, and Unsloth now publishes its own guide for fine-tuning LLMs on AMD GPUs—meaning full training, not just inference, is supported.

The core argument is that ROCm—long the weak link in AMD’s AI story—has matured to the point where standard PyTorch workflows mostly just work. For developers weighing privacy, token costs, and the demands of long-running coding agents like Open Claude or Hermes, this video offers a practical, reproducible assessment of what AMD’s current hardware stack can realistically deliver heading into the second half of 2026.

📺 Source: Sam Witteveen · Published May 26, 2026
🏷️ Format: Hands On Build

1 Item

Channels

No Image Available

Sam Witteveen

1 Item

Companies

No Image Available

AMD

Tags

AMD ComfyUI Gemma 4 LM Studio Ollama OpenClaw Qwen 3.6 Unsloth

Prev

The Playbook for a $100M AI Agency

Next

Bonsai Image: The World’s First 1-bit Image Generator — Running Locally

18 Related Posts

Related Posts

14:58

Coding & Dev Tools

The Ultimate Knowledge Base: Bring YouTube Into Your AI Second Brain

1 hour ago

12:23

Coding & Dev Tools

Microsoft Fara1.5 27B: Local Install + Real Browser Automation Demo

1 day ago

23:27

Coding & Dev Tools

I Built a $10,000 Website for $13 (Claude + Higgsfield)

1 day ago

25:27

Coding & Dev Tools

Full Tutorial: From Idea to App with Claude Design and Claude Code in 25 Minutes

1 day ago

09:07

Coding & Dev Tools

Your AI Agent Is Burning Money (Fix It)

1 day ago

09:16

Coding & Dev Tools

DeepSeek V4 Flash Fully Local — 32 tok/s on a Single Chip

3 days ago