GPT 5.4 is so cracked

GPT 5.4 is so cracked

More

Descriptions:

The AI Search channel puts OpenAI’s newly released GPT 5.4 through a rigorous set of real-world capability tests, going well beyond simple prompts to probe the model’s limits across coding, creative composition, medical imaging, and document analysis. All major demos are run inside OpenAI’s Codex coding agent, which works across entire multi-file projects rather than single-file outputs.

Standout tests include building a fully interactive 3D digital twin of Earth with seamless zoom from orbit to street level — achieved in just three or four iterative prompts — and composing a 32-bar piano piece described as notably more musically complex than outputs from competing models like Gemini 3.1 and GLM5. The video also covers GPT 5.4’s multimodal capabilities: the model is asked to identify and annotate lesions in CT scan imagery, and separately to synthesize earnings reports from Google, Nvidia, and Amazon into a single formatted PDF with charts, growth forecasts, and analyst recommendations after 17 minutes of extended thinking.

The reviewer notes that while GPT 5.4 leads on reasoning-heavy and multimodal tasks, it lags behind some competitors in front-end design quality. The extended thinking mode (set to “extra high” reasoning effort) is used consistently throughout, giving a clear sense of the model’s top-end performance ceiling across diverse domains.


📺 Source: AI Search · Published March 07, 2026
🏷️ Format: Review

1 Item

Channels

1 Item

Companies