GPT-5.4 Is Here — I Tested the New ChatGPT Model

GPT-5.4 Is Here — I Tested the New ChatGPT Model

More

Descriptions:

Skill Leap AI tests GPT-5.4 Thinking shortly after its release, walking through the model’s headline capabilities and comparing it to adjacent OpenAI releases and competing frontier models. The video also contextualizes where GPT-5.4 sits relative to GPT-5.3 Instant (the fast, non-reasoning variant released days earlier) and GPT-5.4 Pro (a research-grade tier), explaining why the versioning split between instant and thinking models may appear non-sequential.

Key capabilities demonstrated include native computer use — now built directly into GPT-5.4 rather than requiring a separate agent model — with examples covering data entry, email handling, and calendar management. The creator also tests knowledge work output: a 15-slide PowerPoint generated from a research prompt in roughly five minutes, and a multi-tab Excel spreadsheet with working formulas produced from a single prompt in about ten minutes. On the coding side, GPT-5.4 Thinking is shown matching GPT-5.3 Codex performance in a general-purpose package, while improved tool-calling efficiency reportedly reduces token consumption even at a slightly higher per-token price.

OpenAI’s internal benchmark comparisons against Anthropic’s Opus 4.6 and Google’s Gemini 3.1 Pro show GPT-5.4 Thinking with a marginal edge on select tasks, though the creator notes results are mixed and the comparison excludes non-OpenAI benchmarks. The video concludes with a live website-building demo using the model’s canvas mode, giving developers a practical reference for real-world output quality.


📺 Source: Skill Leap AI · Published March 05, 2026
🏷️ Format: Review

1 Item

Channels