09:01 Coding & Dev Tools2 weeks ago Running a 27B model at 130 tokens sec on a single GPU Locally with Luce DFlash LlamaDeFlash is a custom inference engine built from scratch in C++ and CUDA — no vLLM, no llama.cpp, no Python in the critical path... 0 comments 7.7K views
10:01 Foundation Models2 weeks ago The Hidden Engine Behind DeepSeek V4 – DeepEP V2 and TileKernels Explained While most coverage of DeepSeek V4 focuses on benchmark scores, Fahd Mirza goes a level deeper to explain the two open-sourced infras... 0 comments 486 views
10:33 Coding & Dev Tools3 weeks ago Kimi FlashKDA: 2x Faster AI Prefill — Installed, Explained and Tested Locally Fahd Mirza walks through the live installation of Flash KDA, Moonshot AI's open-source CUDA kernel that accelerates the prefill phase... 0 comments 1.3K views
01:43:13 Interviews1 month ago Jensen Huang – TPU competition, why we should sell chips to China, & Nvidia’s supply chain moat Dwarkesh Patel sits down with Nvidia CEO Jensen Huang for one of the most substantive interviews the company's founder has given on N... 0 comments 15.1K views
16:23 Tutorials1 month ago 3 Steps to Train Perfect LTX 2.3 Video LoRAs|How to Train Custom LTX 2.3 LoRAs (Video + Audio!) Veteran AI presents a detailed three-part guide to training custom character LoRAs for LTX Video 2.3, covering dataset preparation, t... 0 comments 2.1K views
01:06:06 Interviews2 months ago Jensen Huang: Nvidia’s Future, Physical AI, Rise of the Agent, Inference Explosion, AI PR Crisis In a sit-down interview at Nvidia's GTC conference, CEO Jensen Huang joined the All-In Podcast to explain the company's strategic tra... 0 comments 452K views
14:00 Coding & Dev Tools3 months ago I Coded an AI Waifu that Controls My PC (Claude Cowork + ThreeJS) Alpha Stack walks through building 'Claudina,' a 3D animated desktop companion that integrates with Claude Code's MCP server to give... 0 comments 478 views
12:15 Research & Benchmarks3 months ago Voicebox: Free ElevenLabs Alternative – Runs Locally on Windows CPU Fahd Mirza installs and stress-tests Voicebox, a free open-source voice cloning and text-to-speech application, on a CPU-only Windows... 0 comments 8.1K views
01:24:45 Interviews3 months ago Forward Future Live | 02.06.26 | Guests from Modular, Emergence Capital, & Axiom This episode of Forward Future Live brings together Matthew Berman, co-host Nick Wentz, and three guests to break down the biggest AI... 0 comments 3.6K views
13:28 Tutorials4 months ago Voice Cloning is Dead? Welcome to AI “Voice Design” (Qwen3 TTS)|Qwen3 TTS Full Tutorial Qwen3 TTS goes beyond traditional voice cloning to offer a three-function voice synthesis platform: voice design (generating voices f... 0 comments 3.5K views