The AI Value Index tracks 118+ large language models from leading providers including OpenAI, Google, Anthropic, Qwen, Mistral, and more. Each model profile includes benchmark scores across general intelligence, coding, math, reasoning, speed, and cost metrics.

Models are categorized as Flagship, Mid-Range, Budget, or Open Source based on their capability tier and pricing. Click any model to view its full benchmark profile, or use the Compare tool to see side-by-side comparisons, or check Pricing for detailed cost analysis.

Frequently Asked Questions

How many AI models does the AI Value Index track?

The AI Value Index currently tracks 49+ large language models from 8+ leading providers including OpenAI, Anthropic, Google, Meta, DeepSeek, xAI, Qwen, and Mistral. New models are added as they launch.

What is the difference between Flagship, Mid-Range, Budget, and Open Source models?

Flagship models (e.g. GPT-5.2, Claude Opus 4.6) offer peak capability at premium prices. Mid-Range models balance quality and cost. Budget models (e.g. GPT-5 Nano) prioritize low cost for high-volume use. Open Source models (e.g. Llama, Qwen) can be self-hosted and fine-tuned freely.

Which AI provider has the most models?

OpenAI and Google currently offer the largest model lineups, each with 8+ models spanning flagship to budget tiers. Anthropic, Meta, and DeepSeek each offer 4-6 models, while xAI, Qwen, and Mistral round out the directory.

How often is the AI models directory updated?

The directory is updated within days of a new model launch or pricing change. Benchmark scores are refreshed as new evaluation results become available from official leaderboards and independent testing platforms.

What data is shown on each model profile?

Each model profile includes Chatbot Arena ELO, SWE-bench Verified, MMLU-Pro, HumanEval, and 20+ other benchmark scores, plus input and output pricing per 1M tokens, output speed, context window size, and provider details.

AI Models Directory — Benchmark Profiles for 118+ Models

OpenAI

GPT-5.2

GPT-OSS 120B

GPT-OSS 20B

Sora 2

GPT-5.1 Codex Mini

GPT-5.1 Codex

GPT-5.1

GPT-5 Nano

GPT-5 Mini

GPT-5

o3 Pro

o4 Mini

o3

GPT-4.1 Nano

GPT-4.1 Mini

GPT-4.1

GPT-4.5

o3 Mini

o1 Mini

o1

GPT-4o Mini

GPT-4o

DALL-E 3

Google

Gemini 3.1 Pro

Veo 3.1

Gemini 3 Flash

Gemini 3 Pro

Veo 3

Gemini 2.5 Flash Lite

Imagen 4

Gemini 2.5 Flash

Gemini 2.5 Pro

Gemma 3 4B

Gemma 3 12B

Gemma 3 27B

Gemini 2.0 Flash

Anthropic

Claude Sonnet 4.6

Claude Opus 4.6

Claude Opus 4.5

Claude Haiku 4.5

Claude Sonnet 4.5

Claude Opus 4

Claude Sonnet 4

Claude 3.7 Sonnet

Claude 3.5 Haiku

Claude 3.5 Sonnet

Claude 3 Opus

Qwen

Qwen 3 VL 235B

Qwen 3 Coder 480B

Qwen 3 Next 80B

Qwen 3.5 397B

Qwen 3 Coder

Qwen 3 Max

Qwen 3 32B

Qwen 3 235B

Qwen 2.5 72B

Mistral

Ministral 3 8B

Ministral 3 14B

Mistral Large 3

Magistral Medium 1.2

Mistral Large 25.12

Mistral Small 3.2

Mistral Medium 3.1

Codestral

Pixtral Large

xAI

Grok 4.1 Fast

Grok 4 Fast

Grok 4

Grok 3 Mini

Grok 3

Grok 2

DeepSeek

DeepSeek V3.2