Claude 3.7 Sonnet
Anthropic's most intelligent model to date and the first hybrid reasoning model on the market. Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. API users have fine-grained control over how long the model can think for (up to 128K tokens). Priced at $3 per million input tokens and $15 per million output tokens at launch, including thinking tokens.
Specifications
- Architecture
- Decoder-only Transformer
- License
- Proprietary
- Context Window
- 200,000 tokens
- Max Output
- 128,000 tokens
- Training Data Cutoff
- Nov 2024
- Type
- multimodal
- Modalities
- textimage
Benchmark Scores
Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...
Software Engineering Benchmark (SWE-bench) evaluates models on real-world software engineering tasks...
Tool Augmented Understanding Benchmark (TAU-bench) evaluates models on their ability to use tools....
Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...
Massive Multi-discipline Multimodal Understanding (MMMU) evaluates multimodal understanding across 3...
A dataset of 12,500 challenging competition mathematics problems requiring multi-step reasoning....
American Invitational Mathematics Examination (AIME) 2024 problems....
Advanced Specifications
- Model Family
- Claude
- API Access
- Available
- Chat Interface
- Available
- Multilingual Support
- Yes
Capabilities & Limitations
- Capabilities
- reasoningcodingextended thinkingfront-end web developmentextended thinking control
- Notable Use Cases
- complex problem-solvingsoftware developmentresearch assistanceagentic coding
- Function Calling Support
- Yes
- Tool Use Support
- Yes