Mistral 3: Europe’s Answer to DeepSeek or Too Little, Too Late?

Research & Benchmarks5 months ago

Mistral 3: Europe’s Answer to DeepSeek or Too Little, Too Late?

Descriptions:

Sam Witteveen reviews Mistral’s first major model release in five months: a four-model suite headlined by Mistral Large 3, a 675-billion-parameter mixture-of-experts model with 41 billion active parameters. Notably, this active-parameter ratio is significantly higher than recent MoE releases from OpenAI and Qwen, which typically keep active parameters well under 5% of total. Alongside the flagship model, Mistral has shipped three dense “Mini Mistral” models at 3B, 8B, and 14B parameter sizes—each released with base weights, instruction-tuned variants, and reasoning versions, making this one of the more complete open-weight drops of the period.

On benchmarks, Mistral Large 3 sits roughly on par with DeepSeek 3.1 and Kimi K2 in Mistral’s own comparisons, though Witteveen flags those comparisons as selective—the model ranks 28th overall on the LLM Arena leaderboard, a far cry from frontier closed-model performance. Filtered to Apache 2.0-licensed models, however, it sits near the top, edging out several Qwen 3 variants. The Mini models show competitive results against same-size Gemma and Qwen offerings, with strong instruction-following scores at the 14B tier. Mistral also signals a reasoning version of Large 3 is forthcoming.

Witteveen frames the release within a broader ecosystem gap: few organizations are still releasing multi-size model families with base weights, leaving Mistral as one of the only alternatives to Qwen for practitioners who need fine-tuneable open models across a range of sizes. For teams building on open infrastructure, this release meaningfully expands the available options.

📺 Source: Sam Witteveen · Published December 03, 2025
🏷️ Format: Review

1 Item

Channels

No Image Available

Sam Witteveen

Tags

Anthropic DeepSeek Google Mistral AI OpenAI Qwen

Prev

AlphaFold – The Most Important AI Breakthrough Ever Made

AlphaFold – The Most Important AI Breakthrough Ever Made

Next

I Forced Claude to Code for 24 Hours NONSTOP, Here's What Happened

I Forced Claude to Code for 24 Hours NONSTOP, Here's What Happened

18 Related Posts

Related Posts

42:12

Research & Benchmarks

What AI Agent Should YOU be Using?

23 hours ago

10:46

Research & Benchmarks

Ring-2.6-1T: The 1 Trillion Parameter Open Source Model That NO ONE Can Run

23 hours ago

05:42

Research & Benchmarks

NVIDIA New AI Is An Efficiency Monster

2 days ago

09:34

Research & Benchmarks

I Tried GPT Image 2.0 for 14 Days So You Don’t Have To

3 days ago

30:30

Research & Benchmarks

Which AI Image Generator Should You Actually Use?

5 days ago

24:34

Research & Benchmarks

Codex vs Cowork for Regular People (Every Feature Compared)

7 days ago