Claude 4.5 Haiku
Claude 4.5 Haiku is the fastest and most cost-efficient model in the Claude 4.5 family, designed to deliver near-frontier intelligence with low latency. It is optimized for high-volume tasks, real-time responses, and sub-agent orchestration. Despite its smaller size, it supports advanced features like 'extended thinking' and computer use, making it suitable for parallelized workflows where speed is critical. It is positioned as a drop-in replacement for previous mid-tier models with significantly better economics.
2025-10-15
Unreleased
Decoder-only Transformer
Proprietary
Specifications
- Parameters
- Unreleased
- Architecture
- Decoder-only Transformer
- License
- Proprietary
- Context Window
- 200,000 tokens
- Max Output
- 64,000 tokens
- Training Data Cutoff
- Jul 2025
- Type
- multimodal
- Modalities
- textimage
Benchmark Scores
Advanced Specifications
- Model Family
- Claude
- Finetuned From
- Claude 4
- API Access
- Available
- Chat Interface
- Available
- Multilingual Support
- Yes
- Variants
- claude-haiku-4-5-20251001
Capabilities & Limitations
- Capabilities
- fast reasoningsub-agent orchestrationcodecomputer usevisionmultilingual
- Known Limitations
- less capable in deep theoretical reasoning compared to Opusknowledge cutoff limits real-time data without tools
- Notable Use Cases
- real-time customer supporthigh-volume data extractionparallel sub-agent taskslow-latency coding assistants
- Function Calling Support
- Yes
- Tool Use Support
- Yes