Alibaba logo

Qwen2.5-Math 72B

AlibabaOpen WeightsPending Human Review

A specialized model for mathematical reasoning, supporting Chain-of-Thought (CoT) and Tool-Integrated Reasoning (TIR). It achieves state-of-the-art performance among open-weight models on math benchmarks. It supports bilingual (English/Chinese) math reasoning.

2024-09-19
72B
Decoder-only Transformer
Qwen License

Specifications

Parameters
72B
Architecture
Decoder-only Transformer
License
Qwen License
Context Window
4,096 tokens
Max Output
4,096 tokens
Training Data Cutoff
Sep 2024
Type
text
Modalities
text

Benchmark Scores

Advanced Specifications

Model Family
Qwen
Finetuned From
Qwen2.5-72B
API Access
Not Available
Chat Interface
Not Available
Multilingual Support
Yes
Variants
Qwen2.5-Math-1.5BQwen2.5-Math-7B
Hardware Support
CUDA

Capabilities & Limitations

Capabilities
mathematicschain-of-thoughttool-integrated reasoningbilingual math reasoning
Notable Use Cases
Math problem solvingScientific reasoning
Function Calling Support
Yes
Tool Use Support
Yes

Related Models