GPT-4.1

Name: GPT-4.1
Author: OpenAI

OpenAIProprietaryVerified

Flagship GPT model for complex tasks. It is well suited for problem solving across domains. Features major improvements in coding, instruction following, and long context comprehension.

2025-04-14

Decoder-only Transformer (with vision encoder for images)

Proprietary

Compare with other models

Specifications

Architecture: Decoder-only Transformer (with vision encoder for images)
License: Proprietary
Context Window: 1,047,576 tokens
Max Output: 32,768 tokens
Training Data Cutoff: Jun 2024
Type: multimodal
Modalities: textvision

Benchmark Scores

AIME-202448.1

American Invitational Mathematics Examination (AIME) 2024 problems....

AIME-202537

American Invitational Mathematics Examination (AIME) 2025 problems....

CharXiv-Reasoning56.7

Tests reasoning on challenging problems from arXiv papers across multiple scientific domains....

LiveCodeBench v644.7

Benchmark for evaluating LLMs on code generation tasks from contests....

MathVista72.2

Evaluates mathematical reasoning in visual contexts, combining vision and mathematical problem-solvi...

MMLU90.2

Massive Multitask Language Understanding (MMLU) tests knowledge across 57 subjects including mathema...

MMMU74.8

A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI with 11.5...

Multi-IF70.8

Multi-IF evaluates LLMs on multi-turn and multilingual instruction following across 8 languages, wit...

SWE-Lancer35.1

A benchmark of over 1,400 freelance software engineering tasks from Upwork, valued at $1 million USD...

view all (+18)

Advanced Specifications

Model Family: GPT
API Access: Available
Chat Interface: Not Available

Capabilities & Limitations

Capabilities: codemathreasoningfunction callingstructured outputsfine-tuningdistillationpredicted outputsweb searchfile searchimage generationcode interpreterMCPstreamingbatch APIprompt caching
Known Limitations: Does not support audio modalitiesDoes not support computer useNot available in free tier
Notable Use Cases: coding assistantdocument QAagent systemssoftware engineeringextracting insights from large documentscustomer support automationlegal document analysisfrontend development
Function Calling Support: Yes
Tool Use Support: Yes

Resources

Related Models

GPT-5.2

OpenAI

GPT-5.2 is OpenAI's flagship frontier model series designed for professional knowledge work, advanced coding, and agentic workflows. Released in December 2025 as a response to competitive pressures, it features a massive 400,000-token context window and a 128,000-token maximum output capacity. The model utilizes a Mixture-of-Experts (MoE) architecture to balance inference efficiency with deep reasoning capabilities. It is available in three variants—Instant, Thinking, and Pro—each optimized for different points on the latency-intelligence curve. GPT-5.2 demonstrates state-of-the-art performance in tool calling reliability (98.7%), coding (SWE-Bench Verified 80.0%), and long-context retrieval.

Typemultimodal

ParametersProprietary

2025-12-11

Proprietary

Details Compare

GPT-5.1

OpenAI

GPT-5.1 is a frontier-grade multimodal language model family released by OpenAI in November 2025. It introduces a unified system architecture featuring a 'Smart Router' that dynamically allocates compute resources between two primary modes: 'Instant' (optimized for low latency and conversational warmth) and 'Thinking' (optimized for deep, adaptive reasoning). The model utilizes a Sparse Mixture-of-Experts (MoE) architecture with a central language backbone and attachable modules, allowing it to process text, audio, image, and video inputs natively. Key capabilities include adaptive test-time compute, where the model adjusts its reasoning depth based on query complexity, and enhanced personalization options with distinct personality presets. It demonstrates significant improvements in instruction following, coding (via the Codex variants), and mathematical reasoning compared to its predecessor, GPT-5.

Typemultimodal

ParametersUndisclosed (Estimated ~1.7-2T)

2025-11-12

Proprietary

Details Compare

GPT-OSS-120B

OpenAI

GPT-OSS-120B is a state-of-the-art open-weight language model that delivers strong real-world performance at low cost. This 120 billion parameter mixture-of-experts model achieves near-parity with OpenAI o4-mini on core reasoning benchmarks while running efficiently on a single 80 GB GPU. It was trained using reinforcement learning and techniques informed by OpenAI's most advanced internal models, including o3 and other frontier systems.

Typetext

Parameters117B total (5.1B active per token)

2025-08-05

Open Weights

Details Compare