Descriptions:
In this video, I look at VibeCoder 3b and how it is beating some models that are 300x its size on certain benchmarks by improving its reasoning and chain of thought to be better for specific use cases. While the model is not for production it shows what could be done with these techniques.
Thanks to Dell for Sponsoring the Compute
#DellProPrecision #DellProMax
Paper: https://arxiv.org/abs/2606.16140
Weights: https://huggingface.co/WeiboAI/VibeThinker-3B
Github: https://github.com/WeiboAI/VibeThinker
Twitter: https://x.com/Sam_Witteveen
🕵️ Interested in building LLM Agents? Fill out the form below
Building LLM Agents Form: https://drp.li/dIMes
👨💻Github:
https://github.com/samwit/llm-tutorials
⏱️Time Stamps:
00:00 Intro
01:16 VibeThinker-3B
03:33 Benchmarks
05:16 VibeThinker-3B Paper
05:46 Architecture
09:00 Demo







