Explore

Featured models

inworld/tts-1.5-max

Highest-quality text-to-speech with <200ms latency, emotion control, and 15-language support

22K runs

Official

bytedance/seedream-5-lite

Seedream 5.0 lite: image generation with built-in reasoning, example-based editing, and deep domain knowledge

618.4K runs

Official

runwayml/gen-4.5

State-of-the-art video motion quality, prompt adherence and visual fidelity

69K runs

Official

recraft-ai/recraft-v4

Recraft's latest image generation model, built around design taste. Strong prompt accuracy, art-directed composition, and integrated text rendering. Fast and cost-efficient at standard resolution.

196.8K runs

Official

xai/grok-imagine-video

Generate videos using xAI's Grok Imagine Video model

356.8K runs

Official

moonshotai/kimi-k2.5

Moonshot AI's latest open model. It unifies vision and text, thinking and non-thinking modes, and single-agent and multi-agent execution into one model

28.8K runs

Official

google/gemini-3-flash

Google's most intelligent model built for speed with frontier intelligence, superior search, and grounding

942.8K runs

Official

prunaai/p-video

Fast video generation with built-in draft mode for rapid creative iteration. Text-to-video, image-to-video, and audio-to-video in a single endpoint.

439.7K runs

Official

black-forest-labs/flux-2-klein-4b

Very fast image generation and editing model. 4 steps distilled, sub-second inference for production and near real-time applications.

9.6M runs

Official

openai/gpt-image-1.5

OpenAI's latest image generation model with better instruction following and adherence to prompts

7.3M runs

Official

google/nano-banana-2

Google's fast image generation model with conversational editing, multi-image fusion, and character consistency

4.8M runs

Official

elevenlabs/music

Compose a song from a prompt or a composition plan

32.2K runs

Official

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

wan-video / wan-2.7-image-pro

Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation

2.3K runs

Official

wan-video / wan-2.7-image

Generate and edit images with Alibaba's Wan 2.7

705 runs

Official

google / veo-3.1-lite

Google's cost-efficient video generation model with native audio, optimized for high-volume applications

1.1K runs

Official

wan-video / wan-2.7-videoedit

Edit videos with natural language instructions using Alibaba's Wan 2.7 VideoEdit model

322 runs

Official

wan-video / wan-2.7-r2v

Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-video model

13 runs

Official

wan-video / wan-2.7-i2v

Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model

692 runs

Official

wan-video / wan-2.7-t2v

Generate videos with audio from text prompts using Alibaba's Wan 2.7 model. 1080p, up to 15 seconds, with audio synchronization.

140 runs

Official

xai / grok-imagine-r2v

Generate videos guided by reference images using xAI's Grok Imagine Video model

3.8K runs

Official

xai / grok-imagine-video-extension

Extend videos with xAI's Grok Imagine Video model. Provide a source video and describe what happens next.

878 runs

Official

inworld / tts-1.5-mini

Ultra-fast, cost-efficient text-to-speech with ~120ms latency and 15-language support

5.6K runs

Official

inworld / tts-1.5-max

Highest-quality text-to-speech with <200ms latency, emotion control, and 15-language support

22K runs

Official

prunaai / p-image-upscale

Very efficient image upscaler supporting outputs up to 8 MP. Upscales images to 4 MP in under one second.

1.7K runs

Official

lightricks / ltx-2.3-pro

High-fidelity video generation with portrait support, audio-to-video, retake, and extend. Text, image, and audio-driven creation up to 4K at 50 FPS.

5.5K runs

Official

openai / gpt-5.4

OpenAI's most capable frontier model for complex professional work, coding, and multi-step reasoning.

18K runs

Official

lightricks / ltx-2.3-fast

Lightning-fast video generation with portrait support, camera controls, and synchronized audio. Up to 20 seconds at 1080p, 4K at 50 FPS.

7K runs

Official

kwaivgi / kling-v3-motion-control

Kling 3.0 motion control: transfer motion from a reference video to any character image with improved consistency and quality.

78.6K runs

Official

vidu / q3-turbo

Fast video generation with text-to-video, image-to-video, and start-end-to-video modes. Up to 16 seconds at 1080p with synchronized audio.

441 runs

Official

qwen / qwen-image-2-pro

The pro version of Qwen Image 2 from Alibaba's Qwen team. Enhanced text rendering, realism, and semantic adherence for high-quality image generation and editing.

14.9K runs

Official

qwen / qwen-image-2

A next-generation image generation and editing model from Alibaba's Qwen team. Supports text-to-image and image editing with strong text rendering, especially for Chinese.

8.1K runs

Official

heygen / avatar-iv

Create realistic talking avatar videos from text with HeyGen's Avatar IV engine

284 runs

Official

I want to…

View all collections

Generate images

Use AI to generate images & photos with an API

Caption videos

Use AI to caption videos with an API

Generate speech

Use AI for text-to-speech or to clone your voice via API

Generate images from a face

Use AI to generate images from a face with an API

Generate videos

Use AI to generate videos with an API

Upscale images with super resolution

Use AI to upscale images with super resolution with an API

Generate music

Use AI to generate music with an API

Edit any image

Use AI to edit any image via API

Transcribe speech to text

Use AI to transcribe speech to text via API

OCR to extract text from images

Use AI For Optical Character Recognition (OCR) to extract text from images via API

Remove backgrounds

Use AI to remove backgrounds from images and videos with an API

FLUX family of models

FLUX AI models: advanced image generation & editing via API

Restore images

Use AI to restore images via API

Enhance videos

Use AI to enhance videos via API - Replicate

Detect NSFW content

Detect NSFW content in images and text

Classify text

Classify text by sentiment, topic, intent, or safety

Speaker diarization

Identify speakers from audio and video inputs

Create realistic face swaps

Replace faces across images with natural-looking results.

Turn sketches into images

Transform rough sketches into polished visuals

Generate emojis

Generate custom emojis from text or images

Generate anime-style images and videos

Create anime-style characters, scenes, and animations

Generate videos from images

Use AI to Generate Videos from Images with API

Official models

Official models are always on, predictably priced, and have a stable API.

Large Language Models (LLMs)

Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API

Try AI models for free

Try AI Models for free: video generation, image generation, upscaling, and photo restoration

Lipsync videos

Use AI to generate lipsync videos with an API

Create 3D content

Use AI to create 3D content with an API

Vision models

Chat with images for understanding, captioning & detection via API

Control image generation

Use AI to control image generation with an API

Embedding models

Embedding models for AI search and analysis

Edit your videos

Use AI to edit your videos with an API

Object detection and segmentation

Use AI object detection and segmentation models to distinguish objects in images & videos

Flux fine-tunes

Flux fine-tunes: build and run custom AI image models via API

Kontext fine-tunes

Kontext fine-tunes: Build custom AI image models with an API

Create songs with voice cloning

Create songs with voice cloning models via API

Media utilities

AI media utilities: auto-caption, watermark, frame extraction & more via API

Qwen-Image fine-tunes

Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.

WAN family of models

WAN family of models: powerful image-to-video & text-to-video models

Caption Images

Use AI To Caption Images with an API

Latest models

wan-video / wan-2.7-image-pro

Generate and edit high-quality images with Alibaba's Wan 2.7 Pro with 4K output, thinking mode, text-to-image, multi-image editing, and image set generation

2.3K runs

Official

wan-video / wan-2.7-image

Generate and edit images with Alibaba's Wan 2.7

705 runs

Official

marestreetmarket / multichannel

64 runs

google / veo-3.1-lite

Google's cost-efficient video generation model with native audio, optimized for high-volume applications

1.1K runs

Official

wan-video / wan-2.7-videoedit

Edit videos with natural language instructions using Alibaba's Wan 2.7 VideoEdit model

322 runs

Official

wan-video / wan-2.7-r2v

Generate videos from reference images or clips while preserving subject identity using Alibaba's Wan 2.7 reference-to-video model

13 runs

Official

wan-video / wan-2.7-i2v

Generate videos from images, with support for first-and-last-frame control, clip continuation, and audio synchronization using Alibaba's Wan 2.7 model

692 runs

Official

wan-video / wan-2.7-t2v

Generate videos with audio from text prompts using Alibaba's Wan 2.7 model. 1080p, up to 15 seconds, with audio synchronization.

140 runs

Official

visionaix / metric3dv2

Metric3D v2 (TPAMI 2024): Monocular metric depth and surface normals from a single image. Predicts real-world depth in meters. Works indoor and outdoor.

16 runs

tomhermans / theretroposter01

31 runs

marestreetmarket / albedo

38 runs

palomamachado-png / palomacalazans

50 runs

Explore

FLUX.2 [pro]

Black Forest Labs' most advanced image generation model yet.

How to prompt Seedream 5.0

Recraft V4: image generation with design taste

Run Isaac 0.1 on Replicate

Featured models

Official models

wan-video / wan-2.7-image-pro

wan-video / wan-2.7-image

google / veo-3.1-lite

wan-video / wan-2.7-videoedit

wan-video / wan-2.7-r2v

wan-video / wan-2.7-i2v

wan-video / wan-2.7-t2v

xai / grok-imagine-r2v

xai / grok-imagine-video-extension

inworld / tts-1.5-mini

inworld / tts-1.5-max

prunaai / p-image-upscale

lightricks / ltx-2.3-pro

openai / gpt-5.4

lightricks / ltx-2.3-fast

kwaivgi / kling-v3-motion-control

vidu / q3-turbo

qwen / qwen-image-2-pro

qwen / qwen-image-2

heygen / avatar-iv

I want to…

Latest models

How to prompt Seedream 5.0

Recraft V4: image generation with design taste

Run Isaac 0.1 on Replicate