DeepSeek-R1

Name: DeepSeek-R1
Author: DeepSeek

DeepSeekOpen SourceVerified

DeepSeek-R1 is a first-generation reasoning model trained via large-scale reinforcement learning. Built on DeepSeek-V3-Base, it incorporates cold-start data before RL to address challenges like endless repetition and poor readability found in DeepSeek-R1-Zero. Achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks through advanced chain-of-thought reasoning capabilities.

2025-01-20

671B (37B activated)

Mixture of Experts

MIT

Compare with other models

Specifications

Parameters: 671B (37B activated)
Architecture: Mixture of Experts
License: MIT
Context Window: 128,000 tokens
Max Output: 32,768 tokens
Type: text
Modalities: text

Benchmark Scores

MMLU90.8

Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...

MMLU-Pro84

MMLU-Pro is an enhanced benchmark with over 12,000 challenging questions across 14 domains including...

DROP92.2

Discrete Reasoning Over Paragraphs (DROP) requires models to resolve references in a passage and per...

GPQA71.5

Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...

SimpleQA30.1

A benchmark of simple but precise questions to test factual knowledge and reasoning....

CodeForces2,029

Advanced competitive programming benchmark for evaluating large language models on algorithmic probl...

SWE-bench49.2

Software Engineering Benchmark (SWE-bench) evaluates models on real-world software engineering tasks...

Aider-Polyglot53.3

Tests models on their ability to write code in multiple programming languages....

AIME-202479.8

American Invitational Mathematics Examination (AIME) 2024 problems....

MATH 50097.3

A sample of 500 diverse problems from the MATH benchmark, spanning topics like probability, algebra,...

view all (+7)

Advanced Specifications

Model Family: R1 series
API Access: Available
Chat Interface: Available
Multilingual Support: Yes
Variants: DeepSeek-R1-Zero (RL without SFT)DeepSeek-R1-Distill-Qwen-1.5BDeepSeek-R1-Distill-Qwen-7BDeepSeek-R1-Distill-Qwen-14BDeepSeek-R1-Distill-Qwen-32BDeepSeek-R1-Distill-Llama-8BDeepSeek-R1-Distill-Llama-70B

Capabilities & Limitations

Capabilities: reasoningchain-of-thoughtself-verificationreflectioncodemathsciencelong-form reasoning
Known Limitations: May bypass thinking pattern for certain queriesRequires temperature 0.5-0.7 to prevent endless repetitionsNo system prompt support recommended
Notable Use Cases: complex mathematical reasoningadvanced coding tasksscientific problem solvingstep-by-step reasoningcompetitive programming
Function Calling Support: No
Tool Use Support: No

Resources

Related Models

Gemini 2.5 Flash-Lite

Google

Google's most cost-efficient and fastest 2.5 model yet. Higher quality than 2.0 Flash-Lite on coding, math, science, reasoning and multimodal benchmarks. Excels at high-volume, latency-sensitive tasks like translation and classification.

Gemini 2.5 Flash

Google

Improved across key benchmarks for reasoning, multimodality, code and long context while getting even more efficient. Best for fast performance on complex tasks.

Gemini 2.5 Pro

Google

Gemini 2.5 Pro is capable of reasoning through its thoughts before responding, resulting in enhanced performance and improved accuracy. Features Deep Think, an enhanced reasoning mode, and native audio outputs that capture subtle nuances of speech.