OpenAI

GPT-5.1 — Benchmark Scores, Pricing & Performance Analysis

FLAGSHIPOpenAI
Chatbot Arena ELO
1464
Output Speed
95 tok/s
Input Cost
$1.3/1M
Output Cost
$10.0/1M
Context Window
400K

GPT-5.1 by OpenAI demonstrates top-tier general intelligence, excellent coding ability, outstanding mathematical reasoning. View detailed benchmark data including scores across coding, math, reasoning, speed, and cost metrics.

GPT-5.1 — Benchmark Scores Overview

Scores normalized to percentage scale for visual comparison. ELO scores mapped to 0-100 range (1100-1500).

GPT-5.1 — Frequently Asked Questions

How intelligent is GPT-5.1?

GPT-5.1 scores 1464 on the Chatbot Arena ELO rating, making it one of the top-ranked AI model. This score is based on blind head-to-head human preference voting.

How much does GPT-5.1 cost?

GPT-5.1 costs $1.3 per 1M input tokens and $10.0 per 1M output tokens. This is mid-range pricing for its capability level.

How fast is GPT-5.1?

GPT-5.1 generates output at 95 tokens per second, which is moderate compared to other models. The time to first token is 350 ms.

How good is GPT-5.1 at coding?

GPT-5.1 achieves 76.3% on SWE-bench Verified, demonstrating excellent real-world software engineering capability. This benchmark tests the model's ability to resolve actual GitHub issues.

How good is GPT-5.1 at math and reasoning?

GPT-5.1 scores 95.0% on the MATH benchmark (competition-level mathematics). It also achieves 88.1% on GPQA Diamond, a graduate-level science reasoning benchmark.

What is the context window of GPT-5.1?

GPT-5.1 has a context window of 400K tokens. This determines how much text, conversation history, and code the model can process in a single request.

Who created GPT-5.1?

GPT-5.1 was created by OpenAI. It is classified as a flagship model in the AI Value Index.

Is GPT-5.1 open source?

No, GPT-5.1 is a proprietary model. It is available through OpenAI's API and compatible providers.