10:54 Tutorials23 hours ago Talkie: I Ran a 1930 AI Model Locally and Talked to People from the Past Fahd Mirza explores Talkie, a 13 billion parameter language model built by Alec Radford — the researcher behind GPT-2 — that was trai... 0 comments 520 views
09:22 Tutorials2 days ago DramaBox – Run Most Expressive TTS with Voice Cloning Locally Fahd Mirza takes a hands-on look at DramaBox, a newly released expressive text-to-speech model that can be run locally on consumer-gr... 0 comments 755 views
11:12 Benchmarks5 days ago Qwen3.6 27B Gets 20% Faster with MTP and llama.cpp Locally Fahd Mirza demonstrates how to enable multi-token prediction (MTP) on Qwen3.6 27B using ik_llama.cpp — a community fork of the popula... 0 comments 3.3K views
09:49 Tutorials5 days ago Building AI Evals for Real-World Problems Fahd Mirza walks through how to set up and run OpenAI's evals framework on a practical real-world task: classifying the causes of inv... 0 comments 337 views
11:00 Tutorials5 days ago NVIDIA Nemotron Elastic: 3-in-1 Elastic LLM Like Russian Dolls in One File NVIDIA's Nemotron Elastic model family packs three reasoning models — 30B, 23B, and 12B parameters — into a single checkpoint file us... 0 comments 1.4K views
09:15 Benchmarks6 days ago ZAYA1-VL-8B: Efficient Open Visual Intelligence – Run Locally Fahd Mirza puts ZAYA1-VL-8B — the new vision-language model from Zeffa — through its paces on an NVIDIA RTX 6000 with 48GB of VRAM, s... 0 comments 733 views
09:56 Tutorials7 days ago Local Deep Research + Ollama – Free AI Research Assistant You Control Fahd Mirza walks through a complete installation and demonstration of Local Deep Research, an open-source AI research assistant that... 0 comments 1.1K views
12:04 Tutorials7 days ago Z-Anime: Natural Anime with Studio Quality in Seconds – Run Locally Fahd Mirza walks through the full local installation and generation workflow for Z-Anime (ZZZ Anime), a newly released anime-focused... 0 comments 533 views
08:43 Tutorials1 week ago DFlash Drafter for Gemma 4 26B – Official Speculative Decoding is Here: Run Locally ZLab, the UC San Diego research team that invented DFlash speculative decoding, has released the first official drafter model paired... 0 comments 507 views
08:17 Coding & Dev Tools1 week ago Build Your Own Voice AI Translation App with OpenAI’s Real-Time Translation Model Fahd Mirza walks through building a live voice translation application using OpenAI's newly released GPT real-time translate model—a... 0 comments 497 views