GPT-5 - Frontier Models

There are 89 items in this page

01:15:52

How METR measures Long Tasks and Experienced Open Source Dev Productivity – Joel Becker, METR

Foundation Models4 months ago

How METR measures Long Tasks and Experienced Open Source Dev Productivity – Joel Becker, METR

Joel Becker from METR (Model Evaluation and Threat Research) presents the organization's framework for measuring AI agent task horizo...

14:54

Claude Code is all you need in 2026

Coding & Dev Tools4 months ago

Claude Code is all you need in 2026

Brian Casel, the creator of the widely-used Agent OS framework, makes a direct case that vanilla Claude Code running on Opus 4.5 is s...

01:21:18

Marc Andreessen’s 2026 Outlook: AI Timelines, US vs. China, and The Price of AI

Interviews4 months ago

Marc Andreessen’s 2026 Outlook: AI Timelines, US vs. China, and The Price of AI

In a wide-ranging AMA-style interview published by a16z in January 2026, Marc Andreessen lays out his framework for understanding whe...

34:56

How I would use an LLM to learn Rust

Tutorials4 months ago

How I would use an LLM to learn Rust

Web Dev Cody walks through his personal methodology for using large language models to learn a programming language he has never touc...

11:31

This Test Was Built to Block AI — GPT-5 Finally Passed It

Foundation Models4 months ago

This Test Was Built to Block AI — GPT-5 Finally Passed It

GPT-5 has crossed the human performance threshold on ARC-AGI 2, a benchmark explicitly designed to resist memorization by testing abs...

27:34

[State of Post-Training] From GPT-4.1 to 5.1: RLVR, Agent & Token Efficiency — Josh McGrath, OpenAI

Interviews5 months ago

[State of Post-Training] From GPT-4.1 to 5.1: RLVR, Agent & Token Efficiency — Josh McGrath, OpenAI

Josh McGrath, a post-training researcher at OpenAI working on thinking models, joins Latent Space for a candid discussion of what has...

25:49

The 5 Most Impactful AI Model Releases of 2025

Business & Strategy5 months ago

The 5 Most Impactful AI Model Releases of 2025

This episode counts down the five most impactful AI model releases of 2025, offering a ranked and argued retrospective of the year's...

27:46

⚡️GPT5-Codex-Max: Training Agents with Personality, Tools & Trust — Brian Fioca + Bill Chen, OpenAI

Interviews5 months ago

⚡️GPT5-Codex-Max: Training Agents with Personality, Tools & Trust — Brian Fioca + Bill Chen, OpenAI

Brian Fioca and Bill Chen from OpenAI join the Latent Space podcast at AI Engineer World's Fair to unpack the training philosophy beh...

24:56

The 10 Biggest AI Stories of 2025

Business & Strategy5 months ago

The 10 Biggest AI Stories of 2025

The AI Daily Brief presents its ten biggest AI stories of 2025 in this comprehensive year-end retrospective. The list opens with the...

22:50

Small Bets, Big Impact Building GenBI at a Fortune 100 – Asaf Bord, Northwestern Mutual

Agents & Automation5 months ago

Small Bets, Big Impact Building GenBI at a Fortune 100 – Asaf Bord, Northwestern Mutual

Asaf Bord, a technology leader at Northwestern Mutual, presents a candid and detailed case study at AI Engineer on building GenBI — a...