08:17 Coding & Dev Tools1 week ago Build Your Own Voice AI Translation App with OpenAI’s Real-Time Translation Model Fahd Mirza walks through building a live voice translation application using OpenAI's newly released GPT real-time translate model—a... 0 comments 498 views
08:41 Tutorials1 week ago Gemma 4 31B at 196 tok/s with RedHat DFlash Speculator Locally This hands-on tutorial from the Fahd Mirza channel demonstrates running Google's Gemma 4 31B model locally at 196 tokens per second u... 0 comments 2.2K views
11:07 Tutorials1 week ago Build Your First OpenClaw Plugin from Scratch – Custom Tools with Ollama Fahd Mirza walks through a complete OpenClaw plugin setup from scratch, demonstrating how to extend the open-source AI agent platform... 0 comments 429 views
08:57 Benchmarks1 week ago Google Releases Gemma 4 MTP Drafters – Run Locally and DFlash Comparison Fahd Mirza demonstrates Google's newly released MTP (multi-token prediction) draft models for the Gemma 4 family, running live tests... 0 comments 5.2K views
15:38 Research & Benchmarks2 weeks ago Ling-2.6-1T: Open Source Trillion-Parameter AI with 109 Tokens/sec for FREE Fahd Mirza puts Inclusion AI's latest flagship model, Ling-2.6-1T, through a practical hands-on review. The model is a one-trillion-p... 0 comments 1.8K views
08:53 Tutorials2 weeks ago Hermes Agent Now Runs Natively on LM Studio – Full Local AI Agent Setup Fahd Mirza walks through the complete setup of Hermes Agent—an open-source, self-improving AI agent from Nous Research—with its newly... 0 comments 3.8K views
08:34 Tutorials2 weeks ago Semble + OpenCode + Ollama: Local Code Search MCP for AI Agents Fahd Mirza demonstrates Symbol, a code search library designed specifically for AI coding agents, integrated with the OpenCode termin... 0 comments 1.4K views
09:01 Coding & Dev Tools2 weeks ago Running a 27B model at 130 tokens sec on a single GPU Locally with Luce DFlash LlamaDeFlash is a custom inference engine built from scratch in C++ and CUDA — no vLLM, no llama.cpp, no Python in the critical path... 0 comments 7.7K views
11:39 Coding & Dev Tools2 weeks ago Poolside Laguna XS.2: New Open Weight Coding Model Tested Locally with vLLM Poolside AI has released two new open-weight coding models: Laguna M.1 (2–5 billion parameters) and Laguna XS.2, a 33-billion-paramet... 0 comments 1.1K views
12:24 Benchmarks2 weeks ago Mistral Medium 3.5 128B: Built for Long Stretches on Coding: Full Testing Fahd Mirza puts Mistral Medium 3.5 through hands-on testing in this evaluation of the newly released 128-billion-parameter dense mode... 0 comments 2.9K views