24:44 Foundation Models4 months ago the SCARIEST chart in AI Wes Roth breaks down what he calls \"the scariest chart in AI development history\" — a METR (Meter Research) benchmark tracking AI a... 0 comments 80.6K views
42:15 Foundation Models5 months ago The 5 Levels of AI Coding (Why Most Won’t Make It Past Level 2) Nate B Jones builds an extended analysis around the \"five levels of AI coding\" framework published by Glowforge CEO Dan Shapiro in... 0 comments 229.1K views
01:15:52 Foundation Models6 months ago How METR measures Long Tasks and Experienced Open Source Dev Productivity – Joel Becker, METR Joel Becker from METR (Model Evaluation and Threat Research) presents the organization's framework for measuring AI agent task horizo... 0 comments 9.4K views
21:22 Foundation Models6 months ago Why Agent Hype can fall short of reality – Joel Becker, METR Joel Becker, a researcher at METR (Model Evaluation and Threat Research), presents two empirical studies that together expose a strik... 0 comments 7.5K views