Explore

Featured models

google/gemini-3-flash

Google's most intelligent model built for speed with frontier intelligence, superior search, and grounding

80.8K runs

Official

qwen/qwen3-tts

A unified Text-to-Speech demo featuring three powerful modes: Voice, Clone and Design

7.3K runs

Official

black-forest-labs/flux-2-klein-4b

Very fast image generation and editing model. 4 steps distilled, sub-second inference for production and near real-time applications.

1.4M runs

Official

bytedance/seedance-1.5-pro

A joint audio-video model that accurately follows complex instructions.

232.2K runs

Official

qwen/qwen-image-edit-2511

An enhanced version over Qwen-Image-Edit-2509, featuring multiple improvements including notably better consistency

483.4K runs

Official

openai/gpt-image-1.5

OpenAI's latest image generation model with better instruction following and adherence to prompts

1.8M runs

Official

philz1337x/crystal-video-upscaler

High-precision video upscaler optimized for portraits, faces and products. One of the upscale modes powered by Clarity AI. X:https://x.com/philz1337x

831 runs

Official

openai/gpt-5.2

The best model for coding and agentic tasks across industries

179.7K runs

Official

bytedance/seedream-4.5

Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge

2.3M runs

Official

prunaai/z-image-turbo

Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

15.2M runs

Official

google/nano-banana-pro

Google's state of the art image generation and editing model 🍌🍌

10.9M runs

Official

google/veo-3.1

New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support

314K runs

Official

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

black-forest-labs / flux-2-klein-9b-base-lora

A version of FLUX.2 [klein] 9B-base that supports fast fine-tuned lora inference

32 runs

Official

black-forest-labs / flux-2-klein-4b-base-lora

A version of FLUX.2 [klein] 4B-base that supports fast fine-tuned lora inference

26 runs

Official

lightricks / audio-to-video

Use audio input with an image or prompt to generate videos

101 runs

Official

moonshotai / kimi-k2.5

Moonshot AI's latest open model. It unifies vision and text, thinking and non-thinking modes, and single-agent and multi-agent execution into one model

171 runs

Official

google / gemini-3-flash

Google's most intelligent model built for speed with frontier intelligence, superior search, and grounding

80.8K runs

Official

qwen / qwen3-tts

A unified Text-to-Speech demo featuring three powerful modes: Voice, Clone and Design

7.3K runs

Official

black-forest-labs / flux-2-klein-9b

4 step distilled version of FLUX.2 [klein]. A foundation model for maximum flexibility and control

10.6K runs

Official

black-forest-labs / flux-2-klein-9b-base

Un-distilled version of FLUX.2 [klein]. A foundation model for maximum flexibility and control

13.3K runs

Official

black-forest-labs / flux-2-klein-4b-base

Un-distilled version of FLUX.2 [klein]. Optimized for fine-tuning, customization, and post-training workflows

5.9K runs

Official

black-forest-labs / flux-2-klein-4b

Very fast image generation and editing model. 4 steps distilled, sub-second inference for production and near real-time applications.

1.4M runs

Official

sourceful / riverflow-v2-max-preview

Most powerful iteration of Riverflow model from Sourceful, ideal for brand asset generation

209 runs

Official

sourceful / riverflow-v2-standard-preview

Main version of Riverflow Image Model from Sourceful, ideal for brand design

98 runs

Official

sourceful / riverflow-v2-fast-preview

Fast version of Sourceful Riverflow image generation model, ideal for brand assets

152 runs

Official

lightricks / ltx-2-distilled

The first open source audio-video model

4.7K runs

Official

kwaivgi / kling-v2.6

Kling 2.6 Pro: Top-tier image-to-video with cinematic visuals, fluid motion, and native audio generation

61K runs

Official

qwen / qwen-image-2512

Qwen Image 2512 is an improved version of Qwen Image with more realistic human generation, finer textures, and stronger text rendering

26.9K runs

Official

bytedance / seedance-1.5-pro

A joint audio-video model that accurately follows complex instructions.

232.2K runs

Official

kwaivgi / kling-v2.6-motion-control

Enables precise control of character actions and expressions from a reference image.

91.1K runs

Official

qwen / qwen-image-edit-2511

An enhanced version over Qwen-Image-Edit-2509, featuring multiple improvements including notably better consistency

483.4K runs

Official

openai / gpt-image-1.5

OpenAI's latest image generation model with better instruction following and adherence to prompts

1.8M runs

Official

I want to…

View all collections

Generate images

Use AI to generate images & photos with an API

Caption videos

Use AI to caption videos with an API

Generate speech

Use AI for text-to-speech or to clone your voice via API

Generate images from a face

Use AI to generate images from a face with an API

Generate videos

Use AI to generate videos with an API

Upscale images with super resolution

Use AI to upscale images with super resolution with an API

Generate music

Use AI to generate music with an API

Edit any image

Use AI to edit any image via API

Transcribe speech to text

Use AI to transcribe speech to text via API

OCR to extract text from images

Use AI For Optical Character Recognition (OCR) to extract text from images via API

Remove backgrounds

Use AI to remove backgrounds from images and videos with an API

FLUX family of models

FLUX AI models: advanced image generation & editing via API

Restore images

Use AI to restore images via API

Enhance videos

Use AI to enhance videos via API - Replicate

Detect NSFW content

Detect NSFW content in images and text

Classify text

Classify text by sentiment, topic, intent, or safety

Speaker diarization

Identify speakers from audio and video inputs

Create realistic face swaps

Replace faces across images with natural-looking results.

Turn sketches into images

Transform rough sketches into polished visuals

Generate emojis

Generate custom emojis from text or images

Generate anime-style images and videos

Create anime-style characters, scenes, and animations

Large Language Models (LLMs)

Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API

Generate videos from images

Use AI to Generate Videos from Images with API

Lipsync videos

Use AI to generate lipsync videos with an API

Create 3D content

Use AI to create 3D content with an API

Vision models

Chat with images for understanding, captioning & detection via API

Try AI models for free

Try AI Models for free: video generation, image generation, upscaling, and photo restoration

Control image generation

Use AI to control image generation with an API

Embedding models

Embedding models for AI search and analysis

Edit your videos

Use AI to edit your videos with an API

Object detection and segmentation

Use AI object detection and segmentation models to distinguish objects in images & videos

Official AI models

Official AI models: Always available, stable, and predictably priced

Flux fine-tunes

Flux fine-tunes: build and run custom AI image models via API

Kontext fine-tunes

Kontext fine-tunes: Build custom AI image models with an API

Create songs with voice cloning

Create songs with voice cloning models via API

Media utilities

AI media utilities: auto-caption, watermark, frame extraction & more via API

Qwen-Image fine-tunes

Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.

WAN family of models

WAN family of models: powerful image-to-video & text-to-video models

Caption Images

Use AI To Caption Images with an API

Latest models

vandalwraith / aunt01

15 runs

black-forest-labs / flux-2-klein-9b-base-lora

A version of FLUX.2 [klein] 9B-base that supports fast fine-tuned lora inference

32 runs

Official

black-forest-labs / flux-2-klein-4b-base-lora

A version of FLUX.2 [klein] 4B-base that supports fast fine-tuned lora inference

26 runs

Official

lightricks / audio-to-video

Use audio input with an image or prompt to generate videos

101 runs

Official

moonshotai / kimi-k2.5

Moonshot AI's latest open model. It unifies vision and text, thinking and non-thinking modes, and single-agent and multi-agent execution into one model

171 runs

Official