27:10 Interviews4 months ago The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier Evals Mia Glaese, VP of Research at OpenAI overseeing the Codex, human data, and alignment teams, and Olivia Watkins from OpenAI's Frontier... 0 comments 3.8K views
18:50 Foundation Models4 months ago Gemini 3.1 Pro and the Downfall of Benchmarks: Welcome to the Vibe Era of AI AI Explained delivers a technically dense analysis of Gemini 3.1 Pro following its February 2026 release, framing the broader argumen... 0 comments 107.1K views
21:43 Research & Benchmarks4 months ago Google wins again. Gemini 3.1 Pro review Google's Gemini 3.1 Pro is reviewed across an extensive range of capability tests, accessible via the Gemini app by selecting the Pro... 0 comments 122.9K views
14:41 Benchmarks5 months ago GEMINI 3.1 PRO is the new era… Wes Roth reviews Google DeepMind's newly released Gemini 3.1 Pro, the core reasoning model powering the Gemini ecosystem, with a syst... 0 comments 52.4K views
11:50 Foundation Models5 months ago Qwen 3.5 – The next NEXT model Sam Witteveen breaks down Qwen 3.5, the latest flagship from Alibaba's Qwen team, a 397-billion-parameter mixture-of-experts model wi... 0 comments 24K views
35:32 Interviews5 months ago Full Tutorial: The Most Underrated AI Agent for Coding and Product Work | Eno Reyes (Factory) Eno Reyes, co-founder of Factory AI, joins Peter Yang for a live demo and candid conversation about Droid — Factory's enterprise-grad... 0 comments 13.3K views
33:45 Tutorials5 months ago New #1 open source AI model just dropped GLM-5 from Zhipu AI, available for free at z.ai, is reviewed here as a top-performing open-source language model with a chat interfac... 0 comments 97.5K views
28:06 Business & Strategy5 months ago The AI Wake-Up Call Everyone Needs Right Now! Matt Wolfe's video is anchored by a viral article from Matt Schumer, CEO of Hyperight, which argues that AI displacement of knowledge... 0 comments 180.4K views
53:08 Coding & Dev Tools5 months ago DIY dev tools: How this engineer created “Flowy” to visualize his plans and accelerate coding CJ Hess, a software engineer known for sharing advanced AI workflows on X (formerly Twitter), joins How I AI host Claire Vo to demo F... 0 comments 11.1K views
19:50 Research & Benchmarks5 months ago The Two Best AI Models/Enemies Just Got Released Simultaneously Philip from AI Explained spent under 24 hours reading nearly 250 pages of system cards and running hundreds of tests after Claude Opu... 0 comments 80.1K views