35:15 Business & Strategy2 weeks ago New robot waifus, GLM 5.2 craze, AI spas, new world models, new science agents: AI NEWS AI Search's weekly roundup covers a dense slate of releases spanning open-source models, video generation, robotics, and AI agents. T... 0 comments 52.2K views
14:01 Tutorials3 weeks ago DiffusionGemma GGUF: Run Google’s Fastest Model Locally on Any GPU Fahd Mirza demonstrates how to run DiffusionGemma — Google's new diffusion-based text generation model — locally using a quantized GG... 0 comments 4.6K views
16:47 Research & Benchmarks3 weeks ago Google QAT vs Unsloth QAT + MTP – Which Gemma 4 12B Is Actually Better? This video pits two quantized versions of Google's Gemma 4 12B against each other in a practical, locally-run benchmark: Google's own... 0 comments 3K views
15:50 Foundation Models4 weeks ago Road to 5 Million Tokens: Breaking Barriers in Long Context Training — Max Ryabinin, Together AI Max Ryabinin, VP of Research and Development at Together AI, presents the company's research project on extending transformer trainin... 0 comments 698 views
14:35 Benchmarks4 weeks ago Google QAT vs Unsloth Q4_0 – Which Gemma 4 12B Quantization Is Better? Fahd Mirza runs a controlled comparison between two 4-bit quantized versions of Google's Gemma 4 12B model: Google's own QAT (quantiz... 0 comments 3.2K views
06:01 Coding & Dev Tools4 weeks ago Run Google’s newest 12B AI on a phone? Yes, it’s possible! The Alphastack channel walks through a custom cross-platform app that runs Google's Gemma 4 12B multimodal model entirely on-device —... 0 comments 30 views
32:57 Tutorials1 month ago Unsloth Studio is insane… fine-tune any AI model locally Unsloth Studio is a free, open-source desktop application that brings LLM fine-tuning to consumer hardware — and this video by David... 0 comments 8.3K views
14:33 Coding & Dev Tools1 month ago Running Local AI on AMD Sam Witteveen takes a hands-on look at running local AI on an AMD workstation equipped with a Ryzen Threadripper 9980X processor and... 0 comments 1.1K views
15:31 Coding & Dev Tools2 months ago PFlash + Qwen3.6-27B-DFlash: 10x Faster Prefill on a Single GPU: Run Locally Fahd Mirza builds and benchmarks PFlash, a prefill acceleration tool that dramatically reduces the blank-screen wait time when feedin... 0 comments 3.8K views
12:03 Tutorials2 months ago LTX 2.3 – Improved AI Videos & Extensions in ComfyUI! Nerdy Rodent covers the v1.1 update to LTX 2.3, an open-source AI video model from LightTricks, and demonstrates a full video extensi... 0 comments 2.2K views