San Francisco Compute Inference

To use our openai-compatible inference API, please contact us. We serve these models:


To check the current prices, use the models endpoint.

curl -X GET "https://inference.sfcompute.com/models" -H "Accept: application/json"

To check the current status of a model, use the status endpoint.

curl -X GET "https://inference.sfcompute.com/status" -H "Accept: application/json"