
const response = await fetch('https://relay.opengpu.network/v2/ollama/api/chat', {method: 'POST',headers: {'X-API-Key': process.env.RELAY_API_KEY,'Content-Type': 'application/json',},body: JSON.stringify({model: 'gpt-oss:20b',messages: [{role: 'user',content: 'Break down the pros and cons of decentralized GPU compute.'}],stream: false,think: 'low',}),})const data = await response.json()console.info(data.message.content)
Dual Engine Infrastructure
Switch your mode anytime, or let Relay decide nothing to master or maintain.
Direct Mode
OpenGPU Mode
Available Models
Video Models
Generate high-quality videos from text prompts. From cinematic scenes to creative animations, powered by the latest video generation models.
Image Models
Create stunning images from text descriptions. High-resolution outputs with fine-grained control over style, composition, and detail.
Text Models
Large language models for text generation, summarization, code, and more. Fast inference with high throughput.
AI Service, Your Terms
Publish tasks directly onto the OpenGPU network, or host through our managed layer. Relay adapts to how you build from experimentation to production scale.
Direct mode
OpenGPU mode
Dashboard view

Launch Your AI Compute Into a New Orbit
Partner with Relay to route workloads smarter, balance cost and latency, and keep your business light-years ahead of the competition.