π₯ Next-Generation LLM Gateway and AI Asset Management System
δΈζ | English | FranΓ§ais | ζ₯ζ¬θͺ
Quick Start β’ Key Features β’ Deployment β’ Documentation β’ Help
Note
This is an open-source project developed based on One API
Important
- This project is for personal learning purposes only, with no guarantee of stability or technical support
- Users must comply with OpenAI's Terms of Use and applicable laws and regulations, and must not use it for illegal purposes
- According to the γInterim Measures for the Management of Generative Artificial Intelligence Servicesγ, please do not provide any unregistered generative AI services to the public in China.
No particular order
Thanks to JetBrains for providing free open-source development license for this project
# Clone the project
git clone https://github.com/QuantumNous/new-api.git
cd new-api
# Edit docker-compose.yml configuration
nano docker-compose.yml
# Start the service
docker-compose up -dUsing Docker Commands
# Pull the latest image
docker pull calciumion/new-api:latest
# Using SQLite (default)
docker run --name new-api -d --restart always \
-p 3000:3000 \
-e TZ=Asia/Shanghai \
-v ./data:/data \
calciumion/new-api:latest
# Using MySQL
docker run --name new-api -d --restart always \
-p 3000:3000 \
-e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi" \
-e TZ=Asia/Shanghai \
-v ./data:/data \
calciumion/new-api:latestπ‘ Tip:
-v ./data:/datawill save data in thedatafolder of the current directory, you can also change it to an absolute path like-v /your/custom/path:/data
π After deployment is complete, visit http://localhost:3000 to start using!
π For more deployment methods, please refer to Deployment Guide
π Official Documentation | 
Quick Navigation:
| Category | Link |
|---|---|
| π Deployment Guide | Installation Documentation |
| βοΈ Environment Configuration | Environment Variables |
| π‘ API Documentation | API Documentation |
| β FAQ | FAQ |
| π¬ Community Interaction | Communication Channels |
For detailed features, please refer to Features Introduction
| Feature | Description |
|---|---|
| π¨ New UI | Modern user interface design |
| π Multi-language | Supports Chinese, English, French, Japanese |
| π Data Compatibility | Fully compatible with the original One API database |
| π Data Dashboard | Visual console and statistical analysis |
| π Permission Management | Token grouping, model restrictions, user management |
- β Online recharge (EPay, Stripe)
- β Pay-per-use model pricing
- β Cache billing support (OpenAI, Azure, DeepSeek, Claude, Qwen and all supported models)
- β Flexible billing policy configuration
- π Discord authorization login
- π€ LinuxDO authorization login
- π± Telegram authorization login
- π OIDC unified authentication
- π Key quota query usage (with neko-api-key-tool)
API Format Support:
- β‘ OpenAI Responses
- β‘ OpenAI Realtime API (including Azure)
- β‘ Claude Messages
- β‘ Google Gemini
- π Rerank Models (Cohere, Jina)
Intelligent Routing:
- βοΈ Channel weighted random
- π Automatic retry on failure
- π¦ User-level model rate limiting
Format Conversion:
- π OpenAI Compatible β Claude Messages
- π OpenAI Compatible β Google Gemini
- π Google Gemini β OpenAI Compatible - Text only, function calling not supported yet
- π§ OpenAI Compatible β OpenAI Responses - In development
- π Thinking-to-content functionality
Reasoning Effort Support:
View detailed configuration
OpenAI series models:
o3-mini-high- High reasoning efforto3-mini-medium- Medium reasoning efforto3-mini-low- Low reasoning effortgpt-5-high- High reasoning effortgpt-5-medium- Medium reasoning effortgpt-5-low- Low reasoning effort
Claude thinking models:
claude-3-7-sonnet-20250219-thinking- Enable thinking mode
Google Gemini series models:
gemini-2.5-flash-thinking- Enable thinking modegemini-2.5-flash-nothinking- Disable thinking modegemini-2.5-pro-thinking- Enable thinking modegemini-2.5-pro-thinking-128- Enable thinking mode with thinking budget of 128 tokens- You can also append
-low,-medium, or-highto any Gemini model name to request the corresponding reasoning effort (no extra thinking-budget suffix needed).
For details, please refer to API Documentation - Relay Interface
| Model Type | Description | Documentation |
|---|---|---|
| π€ OpenAI-Compatible | OpenAI compatible models | Documentation |
| π€ OpenAI Responses | OpenAI Responses format | Documentation |
| π¨ Midjourney-Proxy | Midjourney-Proxy(Plus) | Documentation |
| π΅ Suno-API | Suno API | Documentation |
| π Rerank | Cohere, Jina | Documentation |
| π¬ Claude | Messages format | Documentation |
| π Gemini | Google Gemini format | Documentation |
| π§ Dify | ChatFlow mode | - |
| π― Custom | Supports complete call address | - |
View complete interface list
Tip
Latest Docker image: calciumion/new-api:latest
| Component | Requirement |
|---|---|
| Local database | SQLite (Docker must mount /data directory) |
| Remote database | MySQL β₯ 5.7.8 or PostgreSQL β₯ 9.6 |
| Container engine | Docker / Docker Compose |
Common environment variable configuration
| Variable Name | Description | Default Value |
|---|---|---|
SESSION_SECRET |
Session secret (required for multi-machine deployment) | - |
CRYPTO_SECRET |
Encryption secret (required for Redis) | - |
SQL_DSN |
Database connection string | - |
REDIS_CONN_STRING |
Redis connection string | - |
STREAMING_TIMEOUT |
Streaming timeout (seconds) | 300 |
STREAM_SCANNER_MAX_BUFFER_MB |
Max per-line buffer (MB) for the stream scanner; increase when upstream sends huge image/base64 payloads | 64 |
MAX_REQUEST_BODY_MB |
Max request body size (MB, counted after decompression; prevents huge requests/zip bombs from exhausting memory). Exceeding it returns 413 |
32 |
AZURE_DEFAULT_API_VERSION |
Azure API version | 2025-04-01-preview |
ERROR_LOG_ENABLED |
Error log switch | false |
PYROSCOPE_URL |
Pyroscope server address | - |
PYROSCOPE_APP_NAME |
Pyroscope application name | new-api |
PYROSCOPE_BASIC_AUTH_USER |
Pyroscope basic auth user | - |
PYROSCOPE_BASIC_AUTH_PASSWORD |
Pyroscope basic auth password | - |
PYROSCOPE_MUTEX_RATE |
Pyroscope mutex sampling rate | 5 |
PYROSCOPE_BLOCK_RATE |
Pyroscope block sampling rate | 5 |
HOSTNAME |
Hostname tag for Pyroscope | new-api |
π Complete configuration: Environment Variables Documentation
Method 1: Docker Compose (Recommended)
# Clone the project
git clone https://github.com/QuantumNous/new-api.git
cd new-api
# Edit configuration
nano docker-compose.yml
# Start service
docker-compose up -dMethod 2: Docker Commands
Using SQLite:
docker run --name new-api -d --restart always \
-p 3000:3000 \
-e TZ=Asia/Shanghai \
-v ./data:/data \
calciumion/new-api:latestUsing MySQL:
docker run --name new-api -d --restart always \
-p 3000:3000 \
-e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi" \
-e TZ=Asia/Shanghai \
-v ./data:/data \
calciumion/new-api:latestπ‘ Path explanation:
./data:/data- Relative path, data saved in the data folder of the current directory- You can also use absolute path, e.g.:
/your/custom/path:/data
Method 3: BaoTa Panel
- Install BaoTa Panel (β₯ 9.2.0 version)
- Search for New-API in the application store
- One-click installation
π Tutorial with images
Warning
- Must set
SESSION_SECRET- Otherwise login status inconsistent - Shared Redis must set
CRYPTO_SECRET- Otherwise data cannot be decrypted
Retry configuration: Settings β Operation Settings β General Settings β Failure Retry Count
Cache configuration:
REDIS_CONN_STRING: Redis cache (recommended)MEMORY_CACHE_ENABLED: Memory cache
| Project | Description |
|---|---|
| One API | Original project base |
| Midjourney-Proxy | Midjourney interface support |
| Project | Description |
|---|---|
| neko-api-key-tool | Key quota query tool |
| new-api-horizon | New API high-performance optimized version |
| Resource | Link |
|---|---|
| π FAQ | FAQ |
| π¬ Community Interaction | Communication Channels |
| π Issue Feedback | Issue Feedback |
| π Complete Documentation | Official Documentation |
Welcome all forms of contribution!
- π Report Bugs
- π‘ Propose New Features
- π Improve Documentation
- π§ Submit Code
If this project is helpful to you, welcome to give us a βοΈ StarοΌ
Official Documentation β’ Issue Feedback β’ Latest Release
Built with β€οΈ by QuantumNous
