Mistral AI's 2026 Blitz: Six Launches in Two Weeks

Between March 16 and March 31, 2026, Mistral AI shipped more product than most companies ship in a quarter. A unified reasoning model, an open-weight text-to-speech system, a formal proof agent, an enterprise training platform, a developer CLI, and a founding role in NVIDIA's Nemotron Coalition — all in 15 days.

This isn't a company treading water. With ARR hitting $400 million in January 2026 (up from ~$20 million a year earlier, per CEO Arthur Mensch) and a $13.8 billion valuation, the Paris-based startup is executing at a pace that makes its "European OpenAI" label feel less like hype and more like understatement.

Here's every model, product, and milestone — verified from primary sources.

Mistral Small 4: One Model to Replace Three

Released March 16, 2026, Mistral Small 4 merges three previously separate products into a single model: Magistral (reasoning), Pixtral (multimodal vision), and Devstral (agentic coding).

Architecture & Specs

Spec	Detail
Total Parameters	119B
Active Parameters	~6B per token (8B with embeddings/output layers)
Architecture	Mixture of Experts — 128 experts, 4 active per token
Context Window	256K tokens
Input	Text + Image (multimodal)
Output	Text
License	Apache 2.0 (fully open)
Model ID	mistral-small-2603
Self-Hosting	4x H100, 2x H200, or 1x DGX B200 (minimum)

What Makes It Different

The killer feature is configurable reasoning. Set reasoning_effort="none" for fast, lightweight chat (equivalent to Small 3.2). Set it to "high" for deep, step-by-step reasoning that matches Magistral. One model, one deployment, adjustable on the fly.

Despite having 5x more total parameters than Small 3, only 6B are active per token — making it dramatically cheaper to run. Mistral claims 40% reduction in end-to-end completion time and 3x more requests per second compared to Small 3.

Pricing

Metric	Mistral Small 4	GPT-5.4 Mini (comparison)
Input	$0.15/M tokens	$0.75/M tokens
Output	$0.60/M tokens	$4.50/M tokens
Blended (3:1)	$0.26/M tokens	—

At $0.15 per million input tokens, Mistral Small 4 is among the cheapest multimodal reasoning models available — 5x cheaper than GPT-5.4 Mini on input and 7.5x cheaper on output. (Note: Gemini Flash-Lite at $0.075/M and GPT-4.1 Nano at $0.10/M are cheaper but lack configurable reasoning depth.)

Benchmarks

Artificial Analysis Intelligence Index: 27 (ranked #6 out of 58 in its class; median is 15)
Output Speed: 137.3 tokens/second (median: 79.1 t/s)
Time to First Token: 0.97 seconds
Mistral claims Small 4 outperforms GPT-OSS 120B on LiveCodeBench while producing 20% less output

The tradeoff: At 256K context (not 400K or 1M), no computer use, and benchmark scores below GPT-5.4 and Claude Opus 4.6. But at 5–7x less cost than GPT-5.4 Mini, it's a different tier with surprisingly close performance.

Mistral Large 3: The Open-Source Heavyweight

Released December 2, 2025 but still the flagship in 2026, Mistral Large 3 is the largest open-weight MoE model from a major lab.

Specs

Spec	Detail
Total Parameters	675B (including 2.5B vision encoder)
Active Parameters	41B per token
Architecture	Sparse Mixture of Experts
Context Window	256K tokens
Training	3,000 NVIDIA H200 GPUs
Input	Text + Image
License	Apache 2.0
Languages	40+ native languages

Pricing

Metric	Price
Input	$0.50/M tokens
Output	$1.50/M tokens
Blended (3:1)	$0.75/M tokens

Benchmarks

MMLU-Pro: 73.11% (LayerLens/Atlas independent evaluation)
MATH-500: 93.60% (LayerLens/Atlas independent evaluation)
LMArena: #2 in open-source non-reasoning models
Artificial Analysis Intelligence Index: 23 (#16 out of 36 open-weight models)

Note: Mistral did not publish official AIME or GPQA Diamond scores for Large 3. As a non-reasoning model, independent evaluations show it scores significantly lower on reasoning-heavy benchmarks (~40% AIME 2025, ~44% GPQA Diamond per Atlas). The 85% AIME and 71.2% GPQA scores belong to the Ministral 14B reasoning variant (see below).

Where it falls short: Proprietary models like Gemini 3 Pro (91.9% GPQA Diamond), GPT-5.4, and Claude Opus 4.6 still hold clear leads on the hardest reasoning benchmarks. Large 3's output speed (~38 t/s per Artificial Analysis) is notably slow for its class — the tradeoff for 675B total parameters.

Ministral 3: The Small Siblings

Released alongside Large 3 in December 2025, the Ministral 3 family includes dense models at 14B, 8B, and 3B parameters. All Apache 2.0. The 14B reasoning variant hits 85% on AIME 2025 — beating Qwen-14B (73.7%) — making it one of the best small reasoning models available.

Voxtral TTS: Open-Weight Text-to-Speech

Released March 23, 2026, Voxtral TTS is Mistral's first audio model — and a direct shot at ElevenLabs.

Architecture

Component	Parameters
Autoregressive Decoder	3.4B
Acoustic Flow-Matching Module	390M
Neural Audio Codec	300M
Total	~4.1B

Built on Ministral 3B, the default BF16 weights are 8 GB — runnable on a single GPU with 16GB+ VRAM. With quantization, the footprint drops to as low as 3 GB, making it viable for edge devices.

Capabilities

Languages: 9 (English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, Arabic)
Voice Cloning: Zero-shot with as little as 3 seconds of reference audio
Cross-Lingual: Generate English speech with a French voice prompt (and vice versa)
Latency: 70ms model latency on H200 (community reports ~90ms real-world TTFA)
Real-Time Factor: ~9.7x
Max Generation: Up to 2 minutes of audio
Preset Voices: 20 built-in
Output Formats: WAV, PCM, FLAC, MP3, AAC, Opus (24 kHz)

Pricing

$0.016 per 1,000 characters via API — roughly 73% cheaper than ElevenLabs Flash v2.5.

Benchmarks vs. ElevenLabs

68.4% win rate against ElevenLabs Flash v2.5 in multilingual voice cloning evaluations
Parity or higher scores against ElevenLabs v3 in speaker similarity
Widest gaps in Hindi (~80% preference) and Spanish (~88% preference)

License: Open weights on HuggingFace under CC BY-NC 4.0 (non-commercial). Commercial use requires a separate agreement with Mistral. This is the only major Mistral release in 2026 that isn't Apache 2.0.

Leanstral: Formal Proofs for Vibe Coding

Released March 16, 2026, Leanstral is the first open-source AI agent designed specifically for Lean 4 formal proof engineering. The pitch: instead of generating code you have to test, Leanstral generates both the code and a machine-checkable mathematical proof that it's correct.

Specs

Spec	Detail
Total Parameters	119B (same architecture as Small 4)
Active Parameters	~6.5B per token
Architecture	Sparse MoE — 128 experts, 4 active per token
License	Apache 2.0
Model ID	labs-leanstral-2603

FLTEval Benchmark

Model	Pass@2	Pass@16	Cost to Run
Leanstral	26.3	31.9	$36
Claude Sonnet 4.6	23.7	23.9	$549
Claude Opus 4.6	Best quality	—	$1,650

Leanstral beats Sonnet by 8 points at pass@16 while costing 15x less. Opus still leads on quality but at 92x the cost. Mistral claims Leanstral also outperforms GLM-5 (744B), Kimi K2.5 (1T), and Qwen 3.5 (397B) on FLTEval.

Mistral Forge: Enterprise AI Training Platform

Announced March 17, 2026 at NVIDIA GTC, Forge is Mistral's enterprise play — a platform that lets companies build frontier-grade AI models trained on their own proprietary data.

What It Does

Unlike fine-tuning (which adjusts a small fraction of weights) or RAG (which retrieves external context), Forge supports the full training lifecycle:

Pre-training on massive internal datasets
Post-training refinement
Reinforcement learning to align outputs with company policies
Supports both dense and MoE architectures

The centerpiece is Mistral Vibe, an autonomous agent that manages hyperparameter search, synthetic data generation, job scheduling, and model evaluation.

Launch Partners

ASML — Semiconductor manufacturing optimization (also led Mistral's Series C round)
Ericsson — 5G and 5G Advanced network management
European Space Agency — Space data applications
Reply — Italian consulting firm
DSO and HTX — Singapore government agencies

Pricing

License fee for the platform, with optional fees for data pipeline services and forward-deployed engineers. Organizations running training on their own GPU clusters are not charged for compute.

Spaces CLI: Built for Humans and Agents

Released March 31, 2026, Spaces started as an internal platform tool at Mistral and went public when the team realized AI coding agents needed the same tooling as human developers.

In three commands, you go from nothing to a running multi-service project with hot reload, a database, and generated Dockerfiles. Every interactive input has a flag equivalent — so an AI agent can operate the same CLI end-to-end without human intervention.

In one demonstration, an agent configured a fresh repository for deployment, set up CI pipelines, and deployed to production in under 10 minutes with zero hand-holding.

NVIDIA Nemotron Coalition

On March 16, 2026, Mistral joined the NVIDIA Nemotron Coalition as a founding member — contributing large-scale model development and multimodal capabilities. The coalition's first deliverable will be Nemotron 4, an open model co-developed by Mistral and NVIDIA. Other founding members include Cursor, LangChain, Perplexity, Reflection AI (Mira Murati), Black Forest Labs, Sarvam, and Thinking Machines Lab. Mistral Small 4 is available on NVIDIA NIM and build.nvidia.com.

The Full 2026 Product Timeline

Date	Product	Category	License
March (exact date TBD)	Mistral Moderation 2603	Safety	—
March 16	Mistral Small 4	LLM	Apache 2.0
March 16	Leanstral	Code Agent	Apache 2.0
March 16	NVIDIA Nemotron Coalition	Partnership	—
March 17	Mistral Forge	Enterprise Platform	License fee
March 23	Voxtral TTS	Text-to-Speech	CC BY-NC 4.0
March 31	Spaces CLI	Developer Tools	—

Mistral by the Numbers (2026)

Metric	Value
Valuation	$13.8 billion (Sep 2025)
Total Funding	$3.05 billion over 8 rounds
Latest Round	$830M debt (March 2026)
ARR (Jan 2026)	$400 million
ARR Growth	~$20M (early 2025, per CEO) → ~$312M (Dec 2025, Sacra estimate) → $400M+ (Jan 2026, CEO confirmed)
Revenue Target	$1.2 billion (€1B) by end of 2026
Employees	862 (Feb 2026)
Acquisition	Koyeb (Feb 2026) — French serverless cloud platform
Data Center	€1.2B partnership with EcoDataCenter in Borlänge, Sweden (campus: 250 MW first phase, opening 2027)
Founded	April 2023 by Arthur Mensch, Guillaume Lample, Timothée Lacroix
HQ	Paris, France

What's Coming Next

Mistral teased a reasoning variant of Large 3 when the base model launched in December 2025. As of April 2026, it hasn't shipped — but with Small 4's configurable reasoning and Leanstral's formal proof capabilities already live, the reasoning Large 3 is likely imminent.

No new model has been announced for April 2026 yet. But given Mistral's cadence — 6 products in 15 days during March — it would be surprising if nothing drops before the month is over.

The Bottom Line

Mistral AI is no longer just "the European alternative." With Mistral Small 4 at $0.15/M input (among the cheapest multimodal reasoning models available), Voxtral TTS beating ElevenLabs at 73% less cost, and Forge landing ASML and ESA as enterprise customers — the company is building a full-stack AI platform that competes on both price and capability.

The Apache 2.0 licensing on nearly everything (except Voxtral TTS) remains the key differentiator. In a world where OpenAI, Google, and Anthropic keep their weights locked, Mistral is betting that open > closed — and the revenue numbers suggest they might be right.

Sources

]]>

Mistral AI in 2026: Every Model, Product & Milestone (Complete Guide)

Mistral Small 4: One Model to Replace Three

Architecture & Specs

What Makes It Different

Pricing

Benchmarks

Mistral Large 3: The Open-Source Heavyweight

Specs

Pricing

Benchmarks

Ministral 3: The Small Siblings

Voxtral TTS: Open-Weight Text-to-Speech

Architecture

Capabilities

Pricing

Benchmarks vs. ElevenLabs

Leanstral: Formal Proofs for Vibe Coding

Specs

FLTEval Benchmark

Mistral Forge: Enterprise AI Training Platform

What It Does

Launch Partners

Pricing

Spaces CLI: Built for Humans and Agents

NVIDIA Nemotron Coalition

The Full 2026 Product Timeline

Mistral by the Numbers (2026)

What's Coming Next

The Bottom Line

Sources

Related Articles

Mistral Magistral 1.2 and Devstral 2 Review: Europe's AI Champion Goes All-In

Voxtral Mini 4B: Mistral Speech-to-Text That Runs in Your Browser

GLM-5.1: Zhipu's Open-Source Model Scores 94.6% of Claude Opus 4.6 in Coding

Ready to automate your workflows?