Flagship models
Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems.
GPT-5.2
The best model for coding and agentic tasks across industries
Price
$1.750 / 1M tokens
$0.175 / 1M tokens
$14.000 / 1M tokens
GPT-5.2 pro
The smartest and most precise model
Price
$21.00 / 1M tokens
-
$168.00 / 1M tokens
GPT-5 mini
A faster, cheaper version of GPT-5 for well-defined tasks
Price
$0.250 / 1M tokens
$0.025 / 1M tokens
$2.000 / 1M tokens
Pricing reflects standard processing rates. To optimize cost and performance for different use cases, we also offer:
- Batch API(opens in a new window): Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours.
- Priority processing: offers reliable, high-speed performance with the flexibility to pay-as-you-go.
Fine-tuning our models
Customize our models to get even higher performance for your specific use cases.
GPT-4.1
Fine-tuning price
$3.00 / 1M tokens
$0.75 / 1M tokens
$12.00 / 1M tokens
$25.00 / 1M tokens
GPT-4.1 mini
Fine-tuning price
$0.80 / 1M tokens
$0.20 / 1M tokens
$3.20 / 1M tokens
$5.00 / 1M tokens
GPT-4.1 nano
Fine-tuning price
$0.20 / 1M tokens
$0.05 / 1M tokens
$0.80 / 1M tokens
$1.50 / 1M tokens
o4-mini
Reinforcement fine-tuning price
$4.00 / 1M tokens
$1.00 / 1M tokens
$16.00 / 1M tokens
$100.00 / training hour
Our APIs
Realtime API
Build low-latency, multimodal experiences including speech-to-speech.
Sora Video API
Richly detailed, dynamic video generation and remixing with our latest generative model.
| Models | Size | Price per second |
|---|---|---|
| sora-2 | Portrait: 720 x 1280 Landscape: 1280 x 720 | $0.10 |
| sora-2-pro | Portrait: 720 x 1280 Landscape: 1280 x 720 | $0.30 |
| sora-2-pro | Portrait: 1024 x 1792 Landscape: 1792 x 1024 | $0.50 |
Image Generation API
Precise, high-fidelity image generation and editing with our latest multimodal model.
Responses API
Our newest API combining the simplicity of Chat Completions with the built-in tool use of Assistants.
Chat Completions API
Build text-based conversational experiences.
Assistants API
Build assistant-like experiences with our tools.
Built-in tools
Extend model capabilities with built-in tools in the API Platform.
- Tool calls are billed per 1,000 calls, according to the tool version and model type.
- Search content tokens are tokens retrieved from the search index and fed to the model alongside your prompt to generate an answer. These are billed at the model’s input token rate, unless otherwise specified.
| Tool Version | Cost |
|---|---|
| Web search (all models) | $10.00 / 1K calls + search content tokens billed at model rates1 |
| Web search preview (reasoning models) | $10.00 / 1K calls + search content tokens billed at model rates |
| Web search preview (non-reasoning models) | $25.00 / 1K calls + search content tokens are free |
AgentKit
Build, deploy, and optimize production-grade agents with Agent Builder, ChatKit, and Evals.
Explore our offerings for Enterprise customers: Priority processing, Scale Tier and Reserved Capacity.
FAQ
We recommend that developers use our large and mini GPT models for everyday tasks. Our large GPT models generally perform better on a wide range of tasks, while our mini GPT models are fast and inexpensive for simpler tasks.
Our large and mini reasoning models are ideal for complex, multi-step tasks and STEM use cases that require deep thinking about tough problems. You can choose the mini reasoning model if you're looking for a faster, more inexpensive option.
We recommend experimenting with all of these models in the Playground(opens in a new window) to explore which models provide the best price performance trade-off for your usage.