Pricing

Rent the GPU you need, by the hour.

Transparent per-hour GPU pricing. Your OpenAI-compatible endpoint runs on the GPU you rent — no per-token fees, no surprises.

GPU pricing

Per-hour rates by card

GPUVRAMvCPURAMPriceDeploy
More than 80 GB VRAM
B200180 GB28 vCPU283 GB$5.89 /hrDeploy
H200141 GB24 vCPU276 GB$4.39 /hrDeploy
H100 NVL94 GB16 vCPU94 GB$3.19 /hrDeploy
RTX Pro 600096 GB16 vCPU188 GB$2.09 /hrDeploy
80 GB VRAM
H100 SXM80 GB20 vCPU125 GB$3.29 /hrDeploy
H100 PCIe80 GB16 vCPU188 GB$2.89 /hrDeploy
A100 SXM80 GB16 vCPU125 GB$1.49 /hrDeploy
A100 PCIe80 GB8 vCPU117 GB$1.39 /hrDeploy
48 GB VRAM
L4048 GB8 vCPU94 GB$0.99 /hrDeploy
L40S48 GB16 vCPU94 GB$0.86 /hrDeploy
RTX 6000 Ada48 GB10 vCPU167 GB$0.77 /hrDeploy
RTX A600048 GB9 vCPU50 GB$0.49 /hrDeploy
A4048 GB9 vCPU50 GB$0.44 /hrDeploy
32 GB VRAM
RTX 509032 GB9 vCPU35 GB$0.99 /hrDeploy
24 GB VRAM
RTX 409024 GB6 vCPU41 GB$0.69 /hrDeploy
RTX 309024 GB16 vCPU125 GB$0.46 /hrDeploy
L424 GB12 vCPU50 GB$0.39 /hrDeploy
RTX A500024 GB9 vCPU25 GB$0.27 /hrDeploy

Indicative pricing — final rates vary by region & availability.

How billing works

Simple, usage-based, no per-token math

Per-hour GPU rental

Billed by the second after the first minute, only while your deployment is running.

No per-token fees

You rent the GPU; the inference throughput it produces is entirely yours.

Bring your own model

Same GPU pricing whether you use our catalog, your weights, or your container.

Storage

Persistent storage, billed monthly

TypePrice
Volume disk$0.10 / GB / mo
Network storage$0.07 / GB / mo

Indicative pricing — final rates vary by region & availability.

FAQ

Common questions

How is GPU time billed?

Per hour while a deployment is running. Stop it and billing stops — you only pay for the time your GPU is up.

Do you charge per token or per request?

No. You rent the GPU, and the OpenAI-compatible endpoint that runs on it is included. There are no per-token or per-request fees.

Can I bring my own model?

Yes — push your own weights or a custom container. It runs on the same per-hour GPU pricing as the catalog.

Is there a minimum commitment?

No. Pricing is on-demand and pay as you go. Reserved-capacity discounts are coming later.

More questions? Get started and reach us from the dashboard.

Ready to deploy?

Launch an open-source model on Indonesian GPU infrastructure and get a live, OpenAI-compatible endpoint in under 5 minutes.