Anthropic logo

Claude 3.5 Sonnet

AnthropicProprietaryVerified

Claude 3.5 Sonnet raises the industry bar for intelligence, outperforming competitor models and Claude 3 Opus on a wide range of evaluations, with the speed and cost of a mid-tier model. It operates at twice the speed of Claude 3 Opus.

2024-06-20
Decoder-only Transformer
Proprietary

Specifications

Architecture
Decoder-only Transformer
License
Proprietary
Context Window
200,000 tokens
Max Output
8,192 tokens
Training Data Cutoff
Apr 2024
Type
multimodal
Modalities
textimage

Benchmark Scores

Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark of 204 diverse tasks....

DROP87.1

Discrete Reasoning Over Paragraphs (DROP) requires models to resolve references in a passage and per...

The FACTS Grounding Leaderboard evaluates LLMs' ability to generate factually accurate long-form res...

GPQA59.4

Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...

GSM8K96.4

Grade School Math 8K (GSM8K) consists of 8.5K high-quality grade school math word problems....

Evaluates code generation capabilities by asking models to complete Python functions based on docstr...

MATH71.1

A dataset of 12,500 challenging competition mathematics problems requiring multi-step reasoning....

MGSM91.6

Multilingual Grade School Math (MGSM) extends GSM8K to 10 languages....

MMLU88.7

Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...

Testing long-term coherence in agents by simulating a vending machine business. Agents manage orderi...

Advanced Specifications

Model Family
Claude
API Access
Available
Chat Interface
Available
Multilingual Support
Yes

Capabilities & Limitations

Capabilities
advanced reasoningcode generationmathematicsmultilingual problem-solvingtext comprehensionknowledge retrieval
Function Calling Support
Yes
Tool Use Support
Yes

Related Models