Skip to main content
OpenAI

Flagship models

Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems.

GPT-5.2

The best model for coding and agentic tasks across industries

Price

Input:
$1.750 / 1M tokens
Cached input:
$0.175 / 1M tokens
Output:
$14.000 / 1M tokens

GPT-5.2 pro

The smartest and most precise model

Price

Input:
$21.00 / 1M tokens
Cached input:
-
Output:
$168.00 / 1M tokens

GPT-5 mini

A faster, cheaper version of GPT-5 for well-defined tasks

Price

Input:
$0.250 / 1M tokens
Cached input:
$0.025 / 1M tokens
Output:
$2.000 / 1M tokens

Pricing reflects standard processing rates. To optimize cost and performance for different use cases, we also offer:

Fine-tuning our models

Customize our models to get even higher performance for your specific use cases.

GPT-4.1

Fine-tuning price

Input:
$3.00 / 1M tokens
Cached input:
$0.75 / 1M tokens
Output:
$12.00 / 1M tokens
Training:
$25.00 / 1M tokens

GPT-4.1 mini

Fine-tuning price

Input:
$0.80 / 1M tokens
Cached input:
$0.20 / 1M tokens
Output:
$3.20 / 1M tokens
Training:
$5.00 / 1M tokens

GPT-4.1 nano

Fine-tuning price

Input:
$0.20 / 1M tokens
Cached input:
$0.05 / 1M tokens
Output:
$0.80 / 1M tokens
Training:
$1.50 / 1M tokens

o4-mini

Reinforcement fine-tuning price

Input:
$4.00 / 1M tokens
Cached input:
$1.00 / 1M tokens
Output:
$16.00 / 1M tokens
Training:
$100.00 / training hour

Our APIs

Realtime API

Build low-latency, multimodal experiences including speech-to-speech.

gpt-realtime
$4.00 / 1M input tokens
$0.40 / 1M cached input tokens
$16.00 / 1M output tokens
gpt-realtime-mini
$0.60 / 1M input tokens
$0.06 / 1M cached input tokens
$2.40 / 1M output tokens

Sora Video API

Richly detailed, dynamic video generation and remixing with our latest generative model.

ModelsSizePrice per second
sora-2
Portrait: 720 x 1280
Landscape: 1280 x 720
$0.10
sora-2-pro
Portrait: 720 x 1280
Landscape: 1280 x 720
$0.30
sora-2-pro
Portrait: 1024 x 1792
Landscape: 1792 x 1024
$0.50

Image Generation API

Precise, high-fidelity image generation and editing with our latest multimodal model.

GPT-image-1
$5.00 / 1M input tokens
$1.25 / 1M cached input tokens*
-
GPT-image-1-mini
$2.00 / 1M input tokens
$0.20 / 1M cached input tokens*
-
Prompts are billed similarly to other GPT models. Image outputs cost approximately $0.01 (low), $0.04 (medium), and $0.17 (high) for square images.
*available via the Responses API
For detailed token usage by image quality and size, see the docs.

Responses API

Our newest API combining the simplicity of Chat Completions with the built-in tool use of Assistants.

Price
Responses API is not priced separately. Tokens are billed at the chosen language model’s input and output rates.

Chat Completions API

Build text-based conversational experiences.

Price
Chat Completions API is not priced separately. Tokens are billed at the chosen language model's input and output rates.

Assistants API

Build assistant-like experiences with our tools.

Price
Assistants API is not priced separately. Tokens are billed at the chosen language model's input and output rates.

Built-in tools

Extend model capabilities with built-in tools in the API Platform.

Code Interpreter
$0.03 / session
File Search Storage
$0.10 / GB of vector storage per day (first GB free)
File Search Tool Call (Responses API only)
$2.50 / 1k tool calls
Web Search Tool Call
There are two components that contribute to the cost of using the web search tool: (1) Tool calls and (2) Search content tokens.
  • Tool calls are billed per 1,000 calls, according to the tool version and model type.
  • Search content tokens are tokens retrieved from the search index and fed to the model alongside your prompt to generate an answer. These are billed at the model’s input token rate, unless otherwise specified.
Tool VersionCost
Web search (all models)$10.00 / 1K calls + search content tokens billed at model rates1
Web search preview (reasoning models)$10.00 / 1K calls + search content tokens billed at model rates
Web search preview (non-reasoning models)$25.00 / 1K calls + search content tokens are free
1For gpt-4o-mini and gpt-4.1-mini with the web search non-preview tool, search content tokens are charged as a fixed block of 8,000 input tokens per call.
The billing dashboard will report gpt-4.1 and gpt-4.1-mini search line items as ‘web search tool calls | gpt-4o’ and ‘web search tool calls | gpt-4o-mini’
GB refers to binary gigabytes of storage (also known as gibibyte), where 1GB is 2^30 bytes.

AgentKit

Build, deploy, and optimize production-grade agents with Agent Builder, ChatKit, and Evals.

Billing
Begins on November 1, 2025 — no charges will apply before then.
Usage meter
Storage for ChatKit File / Image Uploads.
Free tier (per account, per month)
1 GB
Price beyond free tier
$0.10 / GB-day
What’s always free
Agent Builder – design and iterate with zero cost until you hit Run. Self-hosted ChatKit – host a custom ChatKit backend and pay only normal model-token charges. Enterprise controls – SSO, RBAC, and audit logs are included at no additional fee.

Explore our offerings for Enterprise customers: Priority processing, Scale Tier and Reserved Capacity.

FAQ

We recommend that developers use our large and mini GPT models for everyday tasks. Our large GPT models generally perform better on a wide range of tasks, while our mini GPT models are fast and inexpensive for simpler tasks.

Our large and mini reasoning models are ideal for complex, multi-step tasks and STEM use cases that require deep thinking about tough problems. You can choose the mini reasoning model if you're looking for a faster, more inexpensive option.

We recommend experimenting with all of these models in the Playground⁠(opens in a new window) to explore which models provide the best price performance trade-off for your usage.

Start creating with OpenAI’s powerful models.