GPT-5.1

Name: GPT-5.1
Author: OpenAI

OpenAIProprietaryPending Human Review

GPT-5.1 is a frontier-grade multimodal language model family released by OpenAI in November 2025. It introduces a unified system architecture featuring a 'Smart Router' that dynamically allocates compute resources between two primary modes: 'Instant' (optimized for low latency and conversational warmth) and 'Thinking' (optimized for deep, adaptive reasoning). The model utilizes a Sparse Mixture-of-Experts (MoE) architecture with a central language backbone and attachable modules, allowing it to process text, audio, image, and video inputs natively. Key capabilities include adaptive test-time compute, where the model adjusts its reasoning depth based on query complexity, and enhanced personalization options with distinct personality presets. It demonstrates significant improvements in instruction following, coding (via the Codex variants), and mathematical reasoning compared to its predecessor, GPT-5.

2025-11-12

Undisclosed (Estimated ~1.7-2T)

Sparse Mixture-of-Experts (MoE) with Smart Router Network and Adaptive Test-Time Compute

Proprietary

Compare with other models

Specifications

Parameters: Undisclosed (Estimated ~1.7-2T)
Architecture: Sparse Mixture-of-Experts (MoE) with Smart Router Network and Adaptive Test-Time Compute
License: Proprietary
Context Window: 400,000 tokens
Max Output: 128,000 tokens
Training Data Cutoff: Sep 2024
Type: multimodal
Modalities: textimageaudiovideo

Benchmark Scores

Advanced Specifications

Model Family: GPT-5
Finetuned From: GPT-5
API Access: Available
Chat Interface: Available
Multilingual Support: Yes
Variants: GPT-5.1 InstantGPT-5.1 ThinkingGPT-5.1 ProGPT-5.1-CodexGPT-5.1-Codex-Max

Capabilities & Limitations

Capabilities: adaptive reasoningcode generationmathematical reasoningmultimodal processingagentic workflowspersonality customizationshopping research
Known Limitations: potential hallucinationstone driftbias risksadaptive reasoning latency on complex tasks
Notable Use Cases: coding assistantcomplex problem solvingcreative writingdocument analysisautonomous agents
Function Calling Support: Yes
Tool Use Support: Yes

Resources

Related Models

GPT-5.2

OpenAI

GPT-5.2 is OpenAI's flagship frontier model series designed for professional knowledge work, advanced coding, and agentic workflows. Released in December 2025 as a response to competitive pressures, it features a massive 400,000-token context window and a 128,000-token maximum output capacity. The model utilizes a Mixture-of-Experts (MoE) architecture to balance inference efficiency with deep reasoning capabilities. It is available in three variants—Instant, Thinking, and Pro—each optimized for different points on the latency-intelligence curve. GPT-5.2 demonstrates state-of-the-art performance in tool calling reliability (98.7%), coding (SWE-Bench Verified 80.0%), and long-context retrieval.

Typemultimodal

ParametersProprietary

2025-12-11

Proprietary

Details Compare

GPT-OSS-120B

OpenAI

GPT-OSS-120B is a state-of-the-art open-weight language model that delivers strong real-world performance at low cost. This 120 billion parameter mixture-of-experts model achieves near-parity with OpenAI o4-mini on core reasoning benchmarks while running efficiently on a single 80 GB GPU. It was trained using reinforcement learning and techniques informed by OpenAI's most advanced internal models, including o3 and other frontier systems.

Typetext

Parameters117B total (5.1B active per token)

2025-08-05

Open Weights

Details Compare

GPT-OSS-20B

OpenAI

GPT-OSS-20B is a medium-sized open-weight language model that delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory. This makes it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Despite its smaller size, it demonstrates strong performance on reasoning tasks, tool use, and competition mathematics.

Typetext

Parameters21B total (3.6B active per token)

2025-08-05

Open Weights

Details Compare