Mistral 3: Europe’s Answer to DeepSeek or Too Little, Too Late?

Research & Benchmarks7 months ago

Mistral 3: Europe’s Answer to DeepSeek or Too Little, Too Late?

Descriptions:

Sam Witteveen reviews Mistral’s first major model release in five months: a four-model suite headlined by Mistral Large 3, a 675-billion-parameter mixture-of-experts model with 41 billion active parameters. Notably, this active-parameter ratio is significantly higher than recent MoE releases from OpenAI and Qwen, which typically keep active parameters well under 5% of total. Alongside the flagship model, Mistral has shipped three dense “Mini Mistral” models at 3B, 8B, and 14B parameter sizes—each released with base weights, instruction-tuned variants, and reasoning versions, making this one of the more complete open-weight drops of the period.

On benchmarks, Mistral Large 3 sits roughly on par with DeepSeek 3.1 and Kimi K2 in Mistral’s own comparisons, though Witteveen flags those comparisons as selective—the model ranks 28th overall on the LLM Arena leaderboard, a far cry from frontier closed-model performance. Filtered to Apache 2.0-licensed models, however, it sits near the top, edging out several Qwen 3 variants. The Mini models show competitive results against same-size Gemma and Qwen offerings, with strong instruction-following scores at the 14B tier. Mistral also signals a reasoning version of Large 3 is forthcoming.

Witteveen frames the release within a broader ecosystem gap: few organizations are still releasing multi-size model families with base weights, leaving Mistral as one of the only alternatives to Qwen for practitioners who need fine-tuneable open models across a range of sizes. For teams building on open infrastructure, this release meaningfully expands the available options.

📺 Source: Sam Witteveen · Published December 03, 2025
🏷️ Format: Review

1 Item

Channels

No Image Available

Sam Witteveen

Tags

Anthropic DeepSeek Gemma Google Mistral AI OpenAI Qwen

Prev

AlphaFold – The Most Important AI Breakthrough Ever Made

AlphaFold – The Most Important AI Breakthrough Ever Made

Next

I Forced Claude to Code for 24 Hours NONSTOP, Here's What Happened

I Forced Claude to Code for 24 Hours NONSTOP, Here's What Happened

18 Related Posts

Related Posts

09:06

Research & Benchmarks

Leanstral 1.5 119B A6B: The Free AI That Proves Your Code Is Correct

22 minutes ago

14:03

Research & Benchmarks

Fable 5 is Back! Here’s the Best Way to Use It…

1 day ago

21:10

Research & Benchmarks

I Tested Gemini Spark: What Google’s AI Agent Can Actually Do in 21 Minutes

1 day ago

10:50

Research & Benchmarks

Laguna XS 2.1: Poolside’s Local Coding Agent Tested – Nine Languages

2 days ago

28:52

Research & Benchmarks

GLM-5.2 Proves Open-Source AI is Finally Good Now!

3 days ago

12:40

Research & Benchmarks

Sonnet 5 vs Ornith 35B: Can a Local Model Beat Closed-Source?

3 days ago