Skip to main content

Overview

The FineVoice API gives you programmatic access to a full suite of AI audio capabilities — from text-to-speech and voice conversion to music generation and audio enhancement. All endpoints are served over HTTPS and return JSON responses. Base URL
https://apis.finevoice.ai

Authentication

Every request must include a valid API key as a Bearer token in the Authorization header.
Authorization: Bearer YOUR_API_KEY
Generate or manage your API keys at finevoice.ai/usercenter.
Keep your API key private. Do not expose it in client-side code or public repositories.

API Categories

CategoryDescription
Text to SpeechConvert text into natural-sounding speech with emotion tag support
Voice ConversionTransform the voice in an audio file to a different AI voice
Speech to TextTranscribe audio or video with speaker diarization and timestamps
Sound Effect GenerationGenerate royalty-free sound effects from a prompt or video
Audio SeparationSeparate vocals, instruments, and stems from audio
Podcast GenerationGenerate multi-speaker AI podcasts from a prompt or script
Task StatusPoll the result of any asynchronous task
AI VoicesBrowse and search available voice models
Voice DesignDesign a new AI voice from a text description
AI Voice CloningTrain a custom voice model from a reference audio
Music GenerationGenerate full songs with vocals and instrumentation
Background MusicGenerate background music tracks from a prompt
Music CoverReplace vocals in a song with an AI voice
Audio EnhancementDenoise, normalize, remove silences, and more

Synchronous vs Asynchronous

Some endpoints support an optional useAsync field. When set to true, the API queues the job and immediately returns a taskId. Poll the task status endpoint until the result is ready.
{ "useAsync": true }
Polling task status
GET /v1/task/{task_id}
A completed task returns a url (or urls for multi-output tasks) containing the download link.
{
  "status": 200,
  "taskId": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
  "url": "https://cdn.finevoice.ai/output/result.mp3"
}
Status codes returned in the response body:
statusMeaning
200Completed — result is ready
202In progress — poll again

Request Format

All POST endpoints accept Content-Type: application/json.
POST /v1/audio/speech-synthesis
Authorization: Bearer YOUR_API_KEY
Content-Type: application/json

{
  "voice": "james",
  "text": "[happy] Hello! Welcome to FineVoice."
}

Error Responses

HTTP StatusMeaning
401Unauthorized — missing or invalid API key
422Validation error — check your request body fields
404Not found — resource or task does not exist

Quick Start

See the Quick Start guide to generate your first audio output in under 5 minutes.