Claude Opus 4
Claude Opus 4 is Anthropic's most powerful model and the best coding model in the world, leading on SWE-bench (72.5%) and Terminal-bench (43.2%). It delivers sustained performance on long-running tasks that require focused effort and thousands of steps, with the ability to work continuously for several hours. Features include extended thinking with tool use, parallel tool execution, improved memory capabilities, and significantly reduced shortcut behaviors.
Specifications
- Architecture
- Decoder-only Transformer
- License
- Proprietary
- Context Window
- 200,000 tokens
- Max Output
- 32,000 tokens
- Training Data Cutoff
- Mar 2025
- Type
- multimodal
- Modalities
- textimage
Benchmark Scores
Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...
Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...
Software Engineering Benchmark (SWE-bench) evaluates models on real-world software engineering tasks...
Evaluates models on their ability to use terminal commands to solve system administration tasks....
Massive Multi-discipline Multimodal Understanding (MMMU) evaluates multimodal understanding across 3...
American Invitational Mathematics Examination (AIME) problems test advanced mathematical problem-sol...
Advanced Specifications
- Model Family
- Claude
- API Access
- Available
- Chat Interface
- Available
- Multilingual Support
- Yes
Capabilities & Limitations
- Capabilities
- codingreasoningtool usememoryparallel tool executionextended thinking
- Notable Use Cases
- codingcomplex problem-solvingagent workflowslong-running tasks
- Function Calling Support
- Yes
- Tool Use Support
- Yes