Llama 3.3 70B by Meta demonstrates competitive pricing. View detailed benchmark data including scores across coding, math, reasoning, speed, and cost metrics.
General Benchmarks
Coding Benchmarks
Reasoning Benchmarks
Speed Benchmarks
Cost Benchmarks
Context Benchmarks
Llama 3.3 70B — Benchmark Scores Overview
Scores normalized to percentage scale for visual comparison. ELO scores mapped to 0-100 range (1100-1500).
Compare Llama 3.3 70B With
Llama 3.3 70B — Frequently Asked Questions
How intelligent is Llama 3.3 70B?
Llama 3.3 70B scores 1210 on the Chatbot Arena ELO rating, making it an entry-level AI model. This score is based on blind head-to-head human preference voting.
How much does Llama 3.3 70B cost?
Llama 3.3 70B costs $0.10 per 1M input tokens and $0.30 per 1M output tokens. This makes it one of the more affordable models.
How fast is Llama 3.3 70B?
Llama 3.3 70B generates output at 90 tokens per second, which is moderate compared to other models. The time to first token is 300 ms.
How good is Llama 3.3 70B at coding?
Llama 3.3 70B achieves 22.0% on SWE-bench Verified, demonstrating basic real-world software engineering capability. This benchmark tests the model's ability to resolve actual GitHub issues.
How good is Llama 3.3 70B at math and reasoning?
Llama 3.3 70B scores 60.0% on the MATH benchmark (competition-level mathematics). It also achieves 34.0% on GPQA Diamond, a graduate-level science reasoning benchmark.
What is the context window of Llama 3.3 70B?
Llama 3.3 70B has a context window of 131K tokens. This determines how much text, conversation history, and code the model can process in a single request.
Who created Llama 3.3 70B?
Llama 3.3 70B was created by Meta. It is classified as a open source model in the AI Value Index.
Is Llama 3.3 70B open source?
Yes, Llama 3.3 70B is an open-source model. The model weights are publicly available for download and self-hosting.