Anthropic logo

Claude 3.7 Sonnet

AnthropicProprietaryVerified

Anthropic's most intelligent model to date and the first hybrid reasoning model on the market. Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. API users have fine-grained control over how long the model can think for (up to 128K tokens). Priced at $3 per million input tokens and $15 per million output tokens at launch, including thinking tokens.

2025-02-24
Decoder-only Transformer
Proprietary

Specifications

Architecture
Decoder-only Transformer
License
Proprietary
Context Window
200,000 tokens
Max Output
128,000 tokens
Training Data Cutoff
Nov 2024
Type
multimodal
Modalities
textimage

Benchmark Scores

American Invitational Mathematics Examination (AIME) 2024 problems....

The FACTS Grounding Leaderboard evaluates LLMs' ability to generate factually accurate long-form res...

GPQA84.8

Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...

MATH96.2

A dataset of 12,500 challenging competition mathematics problems requiring multi-step reasoning....

MMLU86.1

Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...

A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI with 11.5...

Software Engineering Benchmark (SWE-bench) evaluates models on real-world software engineering tasks...

Tool Augmented Understanding Benchmark (TAU-bench) evaluates models on their ability to use tools....

Testing long-term coherence in agents by simulating a vending machine business. Agents manage orderi...

Advanced Specifications

Model Family
Claude
API Access
Available
Chat Interface
Available
Multilingual Support
Yes

Capabilities & Limitations

Capabilities
reasoningcodingextended thinkingfront-end web developmentextended thinking control
Notable Use Cases
complex problem-solvingsoftware developmentresearch assistanceagentic coding
Function Calling Support
Yes
Tool Use Support
Yes

Related Models