GPU clusters

All clusters are vetted and meet minimum technical requirements. Deploy on-demand or reserve instances with no long-term contracts.

  • 24/7 support (Slack, phone, email)

  • 1.5TB+ NVMe storage

  • 1TB+ of RAM

  • No ingress/egress fees

  • 3.2Tb/s InfiniBand (fully interconnected)

  • 100% uptime SLA

from $1.10/gpu/hr

starting

8
16
32
64
128
256
1 hour
1 day
1 week
1 month

Pricing shown per gpu/hour based on selected duration and cluster size. Need a custom configuration? Use our CLI or go directly to the buy page.

Need larger clusters?

Get competitive pricing on on-demand clusters up to 2k GPUs or reserved clusters up to 4k interconnected GPUs.

FAQ

Is it bare metal? Containers? VMs? Kubernetes?

When you buy on the website/CLI, you can choose between Kubernetes clusters and virtual machines. If you'd like a bare-metal cluster, reach out to us via the contact form.

How fast do Kubernetes clusters spin up?

About 0.5 seconds.

How fast do VMs spin up?

About 5 minutes.

Are nodes fully-interconnected with InfiniBand?

Depends. Kubernetes clusters include 3.2Tb/s InfiniBand fabric. VMs do not yet support InfiniBand.

Will the nodes go down?

Yes. Hardware failure rates are much higher on GPU clusters than on web servers. At certain scales, they're guaranteed, so we've designed for failure. We have strict hardware requirements and have seen just about everything that can go wrong. Unlike other providers, we refund for failed nodes and can repack your nodes to ones with healthy hardware.

What support do you provide?

Large clusters get a dedicated Slack channel with our engineering team and emergency phone escalation. Our team has managed $1B+ of hardware and worked on several of the world's supercomputers.

© 2025 San Francisco Compute
PrivacyTerms
made with 💙 in SF