METR - Frontier Models

24:44

Foundation Models4 months ago

Wes Roth breaks down what he calls \"the scariest chart in AI development history\" — a METR (Meter Research) benchmark tracking AI a...

42:15

Foundation Models5 months ago

Nate B Jones builds an extended analysis around the \"five levels of AI coding\" framework published by Glowforge CEO Dan Shapiro in...

01:15:52

Foundation Models6 months ago

Joel Becker from METR (Model Evaluation and Threat Research) presents the organization's framework for measuring AI agent task horizo...

21:22

Foundation Models6 months ago

Joel Becker, a researcher at METR (Model Evaluation and Threat Research), presents two empirical studies that together expose a strik...