Anthropic logo

Claude 3 Sonnet

AnthropicProprietaryVerified

Anthropic's balanced model that strikes an ideal balance between intelligence and speed for enterprise workloads. Delivers strong performance at a lower cost compared to its peers. Pricing at launch: $3 per million input tokens, $15 per million output tokens.

2024-03-04
Decoder-only Transformer
Proprietary

Specifications

Architecture
Decoder-only Transformer
License
Proprietary
Context Window
200,000 tokens
Max Output
4,096 tokens
Training Data Cutoff
Aug 2023
Type
multimodal
Modalities
textimage

Benchmark Scores

MMLU79

Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...

GPQA40.4

Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...

GSM8K92.3

Grade School Math 8K (GSM8K) consists of 8.5K high-quality grade school math word problems....

MATH43.1

A dataset of 12,500 challenging competition mathematics problems requiring multi-step reasoning....

MGSM83.5

Multilingual Grade School Math (MGSM) extends GSM8K to 10 languages....

HumanEval73

Evaluates code generation capabilities by asking models to complete Python functions based on docstr...

DROP78.9

Discrete Reasoning Over Paragraphs (DROP) requires models to resolve references in a passage and per...

BIG-bench82.9

Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark of 204 diverse tasks....

ARC93.2

AI2 Reasoning Challenge (ARC) tests reasoning through grade-school science questions....

HellaSwag89

Tests common sense natural language inference through completion of scenarios....

See all benchmarks

Advanced Specifications

Model Family
Claude
API Access
Available
Chat Interface
Available
Multilingual Support
Yes

Capabilities & Limitations

Function Calling Support
Yes
Tool Use Support
No

Related Models