NVIDIA H200s are now live on San Francisco Compute!

The San Francisco Compute Company

We run a market for large-scale, vetted GPU clusters. Need 800 H100s on a single InfiniBand fabric for an hour? You got it. 8 H200s dedicated for a year? Sure. Need to cancel? Just sell back what you don't need. You're never locked-in to a long-term contract.

Never pay to move your data. We run the clusters from UEFI on up and support you just like a traditional cloud.

Buy H100s from $0.85/hr

*Prices are from the private beta and may change.

Average prices for

$0.85average gpu/hr

starting

8
16
32
64
128
256
1 hour
1 day
1 week
1 month

Pricing shown per gpu/hour based on selected duration and cluster size. Need a custom configuration? Use our CLI or go directly to the buy page.

A simple but powerful CLI Read docs

~/training
# Buy 256 H100s for 3 days at a max price of $2.30/GPU/hr 
$ sf buy -n 256 -d 3d -p 2.3

# Sell part of a contract for $2.10/GPU/hr 
$ sf sell -c <contract> -s tomorrow -d 1d -p 2.1

Used to train hundreds of large scale models

PrincetonPlay.htPhindliquid
"Very competitive low prices, an abundant amount of H100s for our projects, and great and responsive customer service."
Tianyu Gao, PhD Student profile pictureTianyu Gao, PhD Student –Princeton Language and Intelligence
"We trust SF Compute to be our primary cloud for training and inference workloads. The hardware spec is great and the team is amazing to work with."
Michael Phind profile pictureMichael Royzen, CEO –Phind
"SF Compute has made reserving compute radically easier and simpler for us, with its transparency in pricing and availability of H100s. We are impressed by the SF Compute team's fast and reliable technical support during training."

FAQ

Is it bare metal? Containers? VMs? Kubernetes?

When you buy on the website/CLI, you can choose between Kubernetes clusters and virtual machines. If you'd like a bare-metal cluster, reach out to us via the contact form.

How fast do Kubernetes clusters spin up?

About 0.5 seconds.

How fast do VMs spin up?

About 5 minutes.

Are nodes fully-interconnected with InfiniBand?

Depends. Kubernetes clusters include 3.2Tb/s InfiniBand fabric. VMs do not yet support InfiniBand.

Will the nodes go down?

Yes. Hardware failure rates are much higher on GPU clusters than on web servers. At certain scales, they're guaranteed, so we've designed for failure. We have strict hardware requirements and have seen just about everything that can go wrong. Unlike other providers, we refund for failed nodes and can repack your nodes to ones with healthy hardware.

What support do you provide?

Large clusters get a dedicated Slack channel with our engineering team and emergency phone escalation. Our team has managed $1B+ of hardware and worked on several of the world's supercomputers.

Job openings

No openings at the moment, but check back soon!

Who are we?

We've been featured in Bloomberg, The New York Times, Latent Space Podcast and The Verge. We have a Twitter X account, @sfcompute. You can find us in-person in San Francisco, the great city of progress.

© 2025 San Francisco Compute
PrivacyTerms
made with 💙 in SF