Claude Sonnet 4
Claude Sonnet 4 significantly improves on Sonnet 3.7's capabilities, excelling in coding with a state-of-the-art 72.7% on SWE-bench. The model balances performance and efficiency, with enhanced steerability for greater control over implementations. Features include extended thinking with tool use, parallel tool execution, improved instruction following, and significantly reduced shortcut behaviors.
Specifications
- Architecture
- Decoder-only Transformer
- License
- Proprietary
- Context Window
- 200,000 tokens
- Max Output
- 64,000 tokens
- Training Data Cutoff
- Mar 2025
- Type
- multimodal
- Modalities
- textimage
Benchmark Scores
Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...
Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...
Software Engineering Benchmark (SWE-bench) evaluates models on real-world software engineering tasks...
Massive Multi-discipline Multimodal Understanding (MMMU) evaluates multimodal understanding across 3...
American Invitational Mathematics Examination (AIME) problems test advanced mathematical problem-sol...
Advanced Specifications
- Model Family
- Claude
- API Access
- Available
- Chat Interface
- Available
- Multilingual Support
- Yes
Capabilities & Limitations
- Capabilities
- codingreasoningtool useparallel tool executionextended thinking
- Notable Use Cases
- codingsoftware developmentagentic scenariosmulti-feature app development
- Function Calling Support
- Yes
- Tool Use Support
- Yes