API Pricing

Flagship models

Our frontier models are designed to spend more time thinking before producing a response, making them ideal for complex, multi-step problems.

GPT-5.2

The best model for coding and agentic tasks across industries

Price

Input:
$1.750 / 1M tokens

Cached input:
$0.175 / 1M tokens

Output:
$14.000 / 1M tokens

GPT-5.2 pro

The smartest and most precise model

Price

Input:
$21.00 / 1M tokens

Cached input:
-

Output:
$168.00 / 1M tokens

GPT-5 mini

A faster, cheaper version of GPT-5 for well-defined tasks

Price

Input:
$0.250 / 1M tokens

Cached input:
$0.025 / 1M tokens

Output:
$2.000 / 1M tokens

Pricing reflects standard processing rates. To optimize cost and performance for different use cases, we also offer:

Batch API⁠⁠(opens in a new window): Save 50% on inputs and outputs with the Batch API and run tasks asynchronously over 24 hours.
Priority processing⁠⁠: offers reliable, high-speed performance with the flexibility to pay-as-you-go.

Fine-tuning our models

Customize our models to get even higher performance for your specific use cases.

GPT-4.1

Fine-tuning price

Input:
$3.00 / 1M tokens

Cached input:
$0.75 / 1M tokens

Output:
$12.00 / 1M tokens

Training:
$25.00 / 1M tokens

GPT-4.1 mini

Fine-tuning price

Input:
$0.80 / 1M tokens

Cached input:
$0.20 / 1M tokens

Output:
$3.20 / 1M tokens

Training:
$5.00 / 1M tokens

GPT-4.1 nano

Fine-tuning price

Input:
$0.20 / 1M tokens

Cached input:
$0.05 / 1M tokens

Output:
$0.80 / 1M tokens

Training:
$1.50 / 1M tokens

o4-mini

Reinforcement fine-tuning price

Input:
$4.00 / 1M tokens

Cached input:
$1.00 / 1M tokens

Output:
$16.00 / 1M tokens

Training:
$100.00 / training hour

Explore detailed pricing(opens in a new window)

Our APIs

Realtime API

Build low-latency, multimodal experiences including speech-to-speech.

gpt-realtime

$4.00 / 1M input tokens

$0.40 / 1M cached input tokens

$16.00 / 1M output tokens

gpt-realtime-mini

$0.60 / 1M input tokens

$0.06 / 1M cached input tokens

$2.40 / 1M output tokens

Sora Video API

Richly detailed, dynamic video generation and remixing with our latest generative model.

Models	Size	Price per second
sora-2	Portrait: 720 x 1280 Landscape: 1280 x 720	$0.10
sora-2-pro	Portrait: 720 x 1280 Landscape: 1280 x 720	$0.30
sora-2-pro	Portrait: 1024 x 1792 Landscape: 1792 x 1024	$0.50

Image Generation API

Precise, high-fidelity image generation and editing with our latest multimodal model.

GPT-image-1

$5.00 / 1M input tokens

$1.25 / 1M cached input tokens*

GPT-image-1-mini

$2.00 / 1M input tokens

$0.20 / 1M cached input tokens*

Prompts are billed similarly to other GPT models. Image outputs cost approximately $0.01 (low), $0.04 (medium), and $0.17 (high) for square images.

*available via the Responses API

For detailed token usage by image quality and size, see the docs.

Responses API

Our newest API combining the simplicity of Chat Completions with the built-in tool use of Assistants.

Price

Responses API is not priced separately. Tokens are billed at the chosen language model’s input and output rates.

Chat Completions API

Build text-based conversational experiences.

Price

Chat Completions API is not priced separately. Tokens are billed at the chosen language model's input and output rates.

Assistants API

Build assistant-like experiences with our tools.

Price

Assistants API is not priced separately. Tokens are billed at the chosen language model's input and output rates.

Built-in tools

Extend model capabilities with built-in tools in the API Platform.

Code Interpreter

$0.03 / session

File Search Storage

$0.10 / GB of vector storage per day (first GB free)

File Search Tool Call (Responses API only)

$2.50 / 1k tool calls

Web Search Tool Call

There are two components that contribute to the cost of using the web search tool: (1) Tool calls and (2) Search content tokens.

Tool calls are billed per 1,000 calls, according to the tool version and model type.
Search content tokens are tokens retrieved from the search index and fed to the model alongside your prompt to generate an answer. These are billed at the model’s input token rate, unless otherwise specified.

Tool Version	Cost
Web search (all models)	$10.00 / 1K calls + search content tokens billed at model rates¹
Web search preview (reasoning models)	$10.00 / 1K calls + search content tokens billed at model rates
Web search preview (non-reasoning models)	$25.00 / 1K calls + search content tokens are free

¹For gpt-4o-mini and gpt-4.1-mini with the web search non-preview tool, search content tokens are charged as a fixed block of 8,000 input tokens per call.

The billing dashboard will report gpt-4.1 and gpt-4.1-mini search line items as ‘web search tool calls | gpt-4o’ and ‘web search tool calls | gpt-4o-mini’

GB refers to binary gigabytes of storage (also known as gibibyte), where 1GB is 2^30 bytes.

AgentKit

Build, deploy, and optimize production-grade agents with Agent Builder, ChatKit, and Evals.

Billing

Begins on November 1, 2025 — no charges will apply before then.

Usage meter

Storage for ChatKit File / Image Uploads.

Free tier (per account, per month)

1 GB

Price beyond free tier

$0.10 / GB-day

What’s always free

Agent Builder – design and iterate with zero cost until you hit Run. Self-hosted ChatKit – host a custom ChatKit backend and pay only normal model-token charges. Enterprise controls – SSO, RBAC, and audit logs are included at no additional fee.

Explore detailed pricing(opens in a new window)

Explore our offerings for Enterprise customers: Priority processing⁠, Scale Tier⁠⁠ and Reserved Capacity⁠⁠.

FAQ

We recommend that developers use our large and mini GPT models for everyday tasks. Our large GPT models generally perform better on a wide range of tasks, while our mini GPT models are fast and inexpensive for simpler tasks.

Our large and mini reasoning models are ideal for complex, multi-step tasks and STEM use cases that require deep thinking about tough problems. You can choose the mini reasoning model if you're looking for a faster, more inexpensive option.

We recommend experimenting with all of these models in the Playground⁠⁠(opens in a new window) to explore which models provide the best price performance trade-off for your usage.

You can set a monthly budget in your billing settings⁠⁠(opens in a new window), after which we’ll stop serving your requests. There may be a delay in enforcing the limit, and you are responsible for any overage incurred. You can also configure an email notification threshold to receive an email alert once you cross that threshold each month. We recommend checking your usage tracking dashboard⁠(opens in a new window) regularly to monitor your spend.

For customers managing work with Projects, you can set and manage billing restrictions per project⁠(opens in a new window)⁠ in the Dashboard.

Images are converted into tokens and charged per token. Text models price image tokens at standard text token rates, while GPT Image and gpt-realtime uses a separate image token rate. Models like gpt-4.1-mini, gpt-4.1-nano, and o4-mini convert images into tokens differently. Learn more in our docs⁠(opens in a new window).

Pricing calculator

Set model

Set width

Set height

Low resolution

=$0.000263

Price per 1M tokens (fixed)	$1.25
512 × 512 tiles	1 × 1
Total tiles	1
Base tokens	70
Tile tokens	140 × 1 = 140
Total tokens	210
Total price	$0.000263

Start creating with OpenAI’s powerful models.

Get started Contact sales

API Pricing

Flagship models

GPT-5.2

Price

GPT-5.2 pro

Price

GPT-5 mini

Price

Fine-tuning our models

GPT-4.1

Fine-tuning price

GPT-4.1 mini

Fine-tuning price

GPT-4.1 nano

Fine-tuning price

o4-mini

Reinforcement fine-tuning price

Our APIs

Realtime API

Text

Audio

Image

Sora Video API

Image Generation API

Text

Image

Responses API

Chat Completions API

Assistants API

Built-in tools

AgentKit

FAQ

Which model should I use?

Do you offer an enterprise package or SLAs?

Will I be charged for API usage in the Playground?

How will I know how many tokens I’ve used each month?

How can I manage my spending on the API platform?

Is access to the API included in ChatGPT Plus, Business, Enterprise or Edu?

How is pricing calculated for images?

Start creating with OpenAI’s powerful models.