Claude 3.5 Sonnet
Claude 3.5 Sonnet raises the industry bar for intelligence, outperforming competitor models and Claude 3 Opus on a wide range of evaluations, with the speed and cost of a mid-tier model. It operates at twice the speed of Claude 3 Opus.
Specifications
- Architecture
- Decoder-only Transformer
- License
- Proprietary
- Context Window
- 200,000 tokens
- Max Output
- 8,192 tokens
- Training Data Cutoff
- Apr 2024
- Type
- multimodal
- Modalities
- textimage
Benchmark Scores
Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark of 204 diverse tasks....
Discrete Reasoning Over Paragraphs (DROP) requires models to resolve references in a passage and per...
The FACTS Grounding Leaderboard evaluates LLMs' ability to generate factually accurate long-form res...
Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...
Grade School Math 8K (GSM8K) consists of 8.5K high-quality grade school math word problems....
Evaluates code generation capabilities by asking models to complete Python functions based on docstr...
A dataset of 12,500 challenging competition mathematics problems requiring multi-step reasoning....
Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...
Testing long-term coherence in agents by simulating a vending machine business. Agents manage orderi...
Advanced Specifications
- Model Family
- Claude
- API Access
- Available
- Chat Interface
- Available
- Multilingual Support
- Yes
Capabilities & Limitations
- Capabilities
- advanced reasoningcode generationmathematicsmultilingual problem-solvingtext comprehensionknowledge retrieval
- Function Calling Support
- Yes
- Tool Use Support
- Yes