Introduction

Quickstart

Build powerful data flywheel engine for LLMs by creating data-driven feedback loops for AI applications that compound over time. There are four key components to the platform:

Capture and observe production traffic. Use models from any provider.
Create and run evals against production data to understand your application.
Create datasets and fine-tune models from observed traffic to create better, task-specific models.
Deploy models behind a stable production endpoint to use in your application.

Together, these four steps create a feedback loop that helps you build better, more reliable AI applications on top of models that you have full control over, rather than relying on closed-source models.

Quickstart

Capture Traffic (Start Here)

Route LLM traffic through Inference.net and observe your first request

API Quickstart

Make your first calls to models deployed on Inference.net

Search models

Browse the model catalog before you pick an API or deployment path.

Meet with Us

Met with to our research team to discuss your use case and get help.

API Quickstart

⌘I

Get Started

Platform

Guides

API

Workhorse Models

Quickstart

Capture Traffic (Start Here)

API Quickstart

Search models

Meet with Us

Get Started

Platform

Guides

API

Workhorse Models

​Quickstart

Capture Traffic (Start Here)

API Quickstart

Search models

Meet with Us

Quickstart