Can you prove AI ROI in Software Eng? (Stanford 120k Devs Study) – Yegor Denisov-Blanch, Stanford

Benchmarks5 months ago

Can you prove AI ROI in Software Eng? (Stanford 120k Devs Study) – Yegor Denisov-Blanch, Stanford

Descriptions:

Yegor Denisov-Blanch from Stanford presents findings from a two-year, large-scale study tracking AI’s impact on software engineering productivity across real enterprise teams. The research uses a machine learning model trained to replicate panels of 10–15 independent human expert evaluators — scoring code commits on implementation time, maintainability, and complexity — enabling measurement at scale without manual review bottlenecks.

Key findings: a matched comparison of 46 AI-using teams against 46 non-AI teams shows a median 10% productivity lift as of mid-2025, with a widening gap between top and bottom performers. Critically, raw AI token usage correlates weakly with outcomes (R²=0.20), while a composite “environment cleanliness index” measuring test coverage, type annotations, documentation, modularity, and code quality correlates far more strongly (R²=0.40). Teams with messy codebases see poor returns even with heavy AI usage — and unchecked AI usage can accelerate codebase entropy.

Denisov-Blanch also introduces an open-source AI practices benchmark with five maturity levels ranging from zero AI use to full agentic orchestration, and illustrates how two business units with identical AI tool access and licensing showed dramatically different adoption rates and outcomes. The talk offers a data-driven framework for enterprise leaders who need to measure AI ROI in engineering, move beyond vanity metrics like token spend, and identify which cohort their organization is in before the productivity gap widens further.

📺 Source: AI Engineer · Published December 11, 2025
🏷️ Format: Benchmark Test

1 Item

Channels

No Image Available

AI Engineer

Tags

Cursor GitHub Copilot

Prev

n8n 2.0 Just Changed Everything (Version Control Is Here)

n8n 2.0 Just Changed Everything (Version Control Is Here)

Next

Nano Banana + Gemini 3 = S-TIER UI DESIGNER

Nano Banana + Gemini 3 = S-TIER UI DESIGNER

18 Related Posts

Related Posts

11:12

Benchmarks

Qwen3.6 27B Gets 20% Faster with MTP and llama.cpp Locally

5 days ago

09:15

Benchmarks

ZAYA1-VL-8B: Efficient Open Visual Intelligence – Run Locally

6 days ago

04:40

Benchmarks

One API Key for Every AI Model (Pay With Crypto)

1 week ago

08:57

Benchmarks

Google Releases Gemma 4 MTP Drafters – Run Locally and DFlash Comparison

1 week ago

08:44

Benchmarks

Are AI Coding Skills Just Hype? I Tested Them

2 weeks ago

11:03

Benchmarks

I Didn’t Expect This: Opus 4.7 vs GPT 5.5

2 weeks ago