Cursor just beat EVERYONE.

Research & Benchmarks2 months ago

Cursor just beat EVERYONE.

Descriptions:

Matthew Berman reviews Cursor’s newly released Composer 2.5, the latest in-house coding model from Cursor built on the Kimi open-source model family. Despite being positioned as a minor incremental release, Berman argues the model is a meaningful inflection point: on Cursor’s internal CursorBench evaluation — which plots models on axes of cost per task versus benchmark score — Composer 2.5 achieves performance close to frontier models at an estimated $0.50 per task, compared to roughly $11 per task for Anthropic’s Opus 4.7 and approximately $4 for OpenAI’s GPT 5.5 Extra High. The model is currently exclusive to the Cursor IDE and is not available through any external API.

Berman contextualizes the release against Google’s simultaneous launch of Gemini 3.5 Flash, arguing that a broader convergence is underway around what he calls “workhorse-class” models — fast, cheap, and capable enough for the majority of production coding tasks. The video includes a segment from his conversation with Google CEO Sundar Pichai at Google IO, in which Pichai explains that serving billions of users across Search and Gemini makes cost efficiency a strategic necessity, not just an optimization.

The central thesis is that raw benchmark rankings matter less than price-performance ratio for most real-world deployments. Berman argues that the vast majority of coding use cases don’t require absolute frontier performance, and that Composer 2.5’s cost profile makes it the practical default choice for developers and teams operating under realistic budget constraints.

📺 Source: Matthew Berman · Published May 26, 2026
🏷️ Format: Review

1 Item

Channels

No Image Available

Matthew Berman

2 Items

Companies

No Image Available

Anthropic

No Image Available

XAI

Tags

Aaron Levie Anthropic Claude Opus 4.7 Composer 2.5 Cursor Elon Musk Gemini Flash 3.5 Google GPT-55 Matthew Berman Moonshot AI OpenAI Sundar Pichai xAI

Prev

The Playbook for a $100M AI Agency

Next

Bonsai Image: The World’s First 1-bit Image Generator — Running Locally

18 Related Posts

Related Posts

08:11

Research & Benchmarks

Inflect Micro v2 – A Complete Voice AI Under 10M Parameters on CPU

2 days ago

38:44

Research & Benchmarks

Jack Dorsey’s Buzz: The New Hermes Agent?

2 days ago

32:44

Research & Benchmarks

Claude Opus 5 is a freak

3 days ago

12:06

Research & Benchmarks

Microsoft Mage-Flow: Image Generation and Editing Locally

3 days ago

10:56

Research & Benchmarks

Claude Chat vs Cowork vs Code: Which One Should You Use?

3 days ago

13:36

Research & Benchmarks

JoyAI Image Edit Plus in ComfyUI – How Does it Compare?

4 days ago