DeepSeek

DeepSeek R1 — Benchmark Scores, Pricing & Performance Analysis

OPEN SOURCEDeepSeek
Chatbot Arena ELO
1355
Output Speed
35 tok/s
Input Cost
$0.55/1M
Output Cost
$2.2/1M
Context Window
131K

DeepSeek R1 by DeepSeek demonstrates strong general intelligence, outstanding mathematical reasoning. View detailed benchmark data including scores across coding, math, reasoning, speed, and cost metrics.

DeepSeek R1 — Benchmark Scores Overview

Scores normalized to percentage scale for visual comparison. ELO scores mapped to 0-100 range (1100-1500).

DeepSeek R1 — Frequently Asked Questions

How intelligent is DeepSeek R1?

DeepSeek R1 scores 1355 on the Chatbot Arena ELO rating, making it a high-performing AI model. This score is based on blind head-to-head human preference voting.

How much does DeepSeek R1 cost?

DeepSeek R1 costs $0.55 per 1M input tokens and $2.2 per 1M output tokens. This is mid-range pricing for its capability level.

How fast is DeepSeek R1?

DeepSeek R1 generates output at 35 tokens per second, which is slower, prioritizing quality over speed compared to other models. The time to first token is 950 ms.

How good is DeepSeek R1 at coding?

DeepSeek R1 achieves 49.2% on SWE-bench Verified, demonstrating moderate real-world software engineering capability. This benchmark tests the model's ability to resolve actual GitHub issues.

How good is DeepSeek R1 at math and reasoning?

DeepSeek R1 scores 97.3% on the MATH benchmark (competition-level mathematics). It also achieves 71.5% on GPQA Diamond, a graduate-level science reasoning benchmark.

What is the context window of DeepSeek R1?

DeepSeek R1 has a context window of 131K tokens. This determines how much text, conversation history, and code the model can process in a single request.

Who created DeepSeek R1?

DeepSeek R1 was created by DeepSeek. It is classified as a open source model in the AI Value Index.

Is DeepSeek R1 open source?

Yes, DeepSeek R1 is an open-source model. The model weights are publicly available for download and self-hosting.