Unsloth - Frontier Models

There are 24 items in this page

35:15

Business & Strategy2 weeks ago

New robot waifus, GLM 5.2 craze, AI spas, new world models, new science agents: AI NEWS

AI Search's weekly roundup covers a dense slate of releases spanning open-source models, video generation, robotics, and AI agents. T...

14:01

Tutorials3 weeks ago

DiffusionGemma GGUF: Run Google’s Fastest Model Locally on Any GPU

Fahd Mirza demonstrates how to run DiffusionGemma — Google's new diffusion-based text generation model — locally using a quantized GG...

16:47

Research & Benchmarks3 weeks ago

Google QAT vs Unsloth QAT + MTP – Which Gemma 4 12B Is Actually Better?

This video pits two quantized versions of Google's Gemma 4 12B against each other in a practical, locally-run benchmark: Google's own...

15:50

Foundation Models4 weeks ago

Road to 5 Million Tokens: Breaking Barriers in Long Context Training — Max Ryabinin, Together AI

Max Ryabinin, VP of Research and Development at Together AI, presents the company's research project on extending transformer trainin...

14:35

Benchmarks4 weeks ago

Google QAT vs Unsloth Q4_0 – Which Gemma 4 12B Quantization Is Better?

Fahd Mirza runs a controlled comparison between two 4-bit quantized versions of Google's Gemma 4 12B model: Google's own QAT (quantiz...

06:01

Coding & Dev Tools4 weeks ago

Run Google’s newest 12B AI on a phone? Yes, it’s possible!

The Alphastack channel walks through a custom cross-platform app that runs Google's Gemma 4 12B multimodal model entirely on-device —...

32:57

Tutorials1 month ago

Unsloth Studio is insane… fine-tune any AI model locally

Unsloth Studio is a free, open-source desktop application that brings LLM fine-tuning to consumer hardware — and this video by David...

14:33

Coding & Dev Tools1 month ago

Running Local AI on AMD

Sam Witteveen takes a hands-on look at running local AI on an AMD workstation equipped with a Ryzen Threadripper 9980X processor and...

15:31

Coding & Dev Tools2 months ago

PFlash + Qwen3.6-27B-DFlash: 10x Faster Prefill on a Single GPU: Run Locally

Fahd Mirza builds and benchmarks PFlash, a prefill acceleration tool that dramatically reduces the blank-screen wait time when feedin...

12:03

Tutorials2 months ago

LTX 2.3 – Improved AI Videos & Extensions in ComfyUI!

Nerdy Rodent covers the v1.1 update to LTX 2.3, an open-source AI video model from LightTricks, and demonstrates a full video extensi...