Qwen2.5-Math 72B
A specialized model for mathematical reasoning, supporting Chain-of-Thought (CoT) and Tool-Integrated Reasoning (TIR). It achieves state-of-the-art performance among open-weight models on math benchmarks. It supports bilingual (English/Chinese) math reasoning.
2024-09-19
72B
Decoder-only Transformer
Qwen License
Specifications
- Parameters
- 72B
- Architecture
- Decoder-only Transformer
- License
- Qwen License
- Context Window
- 4,096 tokens
- Max Output
- 4,096 tokens
- Training Data Cutoff
- Sep 2024
- Type
- text
- Modalities
- text
Benchmark Scores
Advanced Specifications
- Model Family
- Qwen
- Finetuned From
- Qwen2.5-72B
- API Access
- Not Available
- Chat Interface
- Not Available
- Multilingual Support
- Yes
- Variants
- Qwen2.5-Math-1.5BQwen2.5-Math-7B
- Hardware Support
- CUDA
Capabilities & Limitations
- Capabilities
- mathematicschain-of-thoughttool-integrated reasoningbilingual math reasoning
- Notable Use Cases
- Math problem solvingScientific reasoning
- Function Calling Support
- Yes
- Tool Use Support
- Yes