H100
80GB vRAM
- HBM3 · 3.35 TB/s
- Flagship throughput
from$2.79/hr
A focused path from an open-source model to a live, billable API — without standing up GPUs, runtimes, or gateways yourself.

Drop-in compatible with the OpenAI SDK. Swap the base URL, keep your code.
curl https://api.nusapod.io/v1/chat/completions \
-H "Authorization: Bearer $NUSAPOD_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"qwen2.5-7b-instruct","messages":[{"role":"user","content":"Hello"}]}'From flagship H100s to value-tier 4090s — pick the card that fits your model and your budget, and pay only for the hours you run.
Indicative pricing — final rates vary by region.

Pick a model and it runs on vLLM behind an OpenAI-compatible endpoint in minutes.