cloudfit scores and ranks cloud instances across AWS, GCP, and Azure against your workload profile, and stays current as providers deprecate and release new machine types.
Two ways to try cloudfit right now. The one-click Gradio UI takes a workload profile through a form and returns ranked instance recommendations. The FastAPI service serves the same scoring engine over HTTP for programmatic use. Both run on a bundled snapshot of 875 GCP machine types across 5 regions (us-central1, us-east1, us-west1, europe-west4, asia-southeast1) with realistic asymmetric availability. No credentials needed.
Open the UI ↗ Swagger UI ↗ /health ↗
The Space sleeps when idle, so the first request may take 30 to 60 seconds to wake the container. Subsequent requests are instant.
WorkloadProfile or a YAML file. cloudfit understands five resource archetypes: I/O, CPU, memory, GPU, and burst-parallel.optimize_for mode. No cloud credentials needed.
cost_score is relative to the candidate set (cheapest qualifying instance = 1.0, dearest = 0.0), so a real price gap moves the score. archetype is a classification and disk-sizing label in this release; it does not change ranking, which is driven by optimize_for.
| Mode | w_cost | w_perf | w_avail | Best for |
|---|---|---|---|---|
| cost | Batch jobs, dev environments | |||
| balanced | Default for production workloads | |||
| performance | Latency-sensitive, GPU inference | |||
| availability | Long-running, deprecation-sensitive |
/recommend · /instances · /providers · /diff. Multi-region snapshot (5 regions). FastAPI with OpenAPI docs. Self-host with Docker, or use the live demo on Hugging Face Spaces.