Claude Opus 4 by Anthropic demonstrates strong general intelligence, excellent coding ability. View detailed benchmark data including scores across coding, math, reasoning, speed, and cost metrics.
General Benchmarks
Coding Benchmarks
Reasoning Benchmarks
Speed Benchmarks
Cost Benchmarks
Context Benchmarks
Claude Opus 4 — Benchmark Scores Overview
Scores normalized to percentage scale for visual comparison. ELO scores mapped to 0-100 range (1100-1500).
Compare Claude Opus 4 With
Claude Opus 4 — Frequently Asked Questions
How intelligent is Claude Opus 4?
Claude Opus 4 scores 1375 on the Chatbot Arena ELO rating, making it a high-performing AI model. This score is based on blind head-to-head human preference voting.
How much does Claude Opus 4 cost?
Claude Opus 4 costs $15.0 per 1M input tokens and $75.0 per 1M output tokens. This places it in the premium pricing tier.
How fast is Claude Opus 4?
Claude Opus 4 generates output at 50 tokens per second, which is slower, prioritizing quality over speed compared to other models. The time to first token is 550 ms.
How good is Claude Opus 4 at coding?
Claude Opus 4 achieves 72.5% on SWE-bench Verified, demonstrating excellent real-world software engineering capability. This benchmark tests the model's ability to resolve actual GitHub issues.
How good is Claude Opus 4 at math and reasoning?
Claude Opus 4 scores 86.0% on the MATH benchmark (competition-level mathematics). It also achieves 79.6% on GPQA Diamond, a graduate-level science reasoning benchmark.
What is the context window of Claude Opus 4?
Claude Opus 4 has a context window of 200K tokens. This determines how much text, conversation history, and code the model can process in a single request.
Who created Claude Opus 4?
Claude Opus 4 was created by Anthropic. It is classified as a flagship model in the AI Value Index.
Is Claude Opus 4 open source?
No, Claude Opus 4 is a proprietary model. It is available through Anthropic's API and compatible providers.