o1-preview
Large reasoning model with strong capabilities for solving hard problems through extended thinking. Designed to spend more time reasoning before responding, with enhanced performance in science, coding, and math.
Specifications
- Architecture
- Decoder-only Transformer
- License
- Proprietary
- Context Window
- 128,000 tokens
- Max Output
- 32,768 tokens
- Training Data Cutoff
- Sep 30, 2023
- Type
- text
- Modalities
- text
Benchmark Scores
American Invitational Mathematics Examination (AIME) problems test advanced mathematical problem-sol...
Evaluates models on competitive programming problems from the Codeforces platform....
Evaluates code generation capabilities by asking models to complete Python functions based on docstr...
Evaluates models on their ability to solve cybersecurity challenges across various domains including...
Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...
Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...
A sample of 500 diverse problems from the MATH benchmark, spanning topics like probability, algebra,...
Advanced Specifications
- Model Family
- o-series
- API Access
- Available
- Chat Interface
- Available
- Multilingual Support
- Yes
Capabilities & Limitations
- Capabilities
- reasoningcodemathscience
- Known Limitations
- no function callingno structured outputsno streamingno system messages
- Notable Use Cases
- complex reasoningscientific problem solvingadvanced mathematicscomplex coding tasks
- Function Calling Support
- No
- Tool Use Support
- No