Model Serving Pricing
Model Serving
Make live predictions in your apps and websites
Select plan
help me chooseSelect cloud
Select region*
Select
Loading...
GPU Model Serving DBU Rate
Instance Size | GPU configuration | DBUs / hour |
---|---|---|
Small | T4 or equivalent | 10.48 |
Medium | A10G x 1GPU or equivalent | 20.00 |
Medium 4X | A10G x 4GPU or equivalent | 112.00 |
Medium 8x | A10G x 8GPU or equivalent | 290.80 |
XLarge | A100 40GB x 8GPU or equivalent | 538.40 |
XLarge | A100 80GB x 8GPU or equivalent | 628.00 |
Pay as you go with a 14-day free trial or contact us for committed-use discounts or custom requirements.
FAQ
Our regional prices are based on the regional cost of infrastructure supporting our serverless products