Plans and pricing 

Pay for what you use. Only pay for the time your model is actively deploying, scaling up or down, or making predictions. Further calibrate autoscaling settings to save even more on compute resources. See all available instance types.

Get started for free
Select an instance type
1 T4 GPU, 16 GiB VM, 4 vCPUs, 16 GiB
1 L4 GPU, 24 GiB VRAM, 4 vCPUs, 16 GiB
1 A10s GPU, 24 GiB VM, 4 vCPUs, 16 GiB
1 A100 GPU, 80 GiB VRAM, 12 vCPUs, 144 GiB
1 H100 GPU, 80 GiB VRAM, 26 vCPUs, 234 GiB
1 vCPU, 2GiB RAM
1 vCPU, 4GiB RAM
2 vCPUs, 8GiB RAM
4 vCPUs, 16GiB RAM
8 vCPUs, 32GiB RAM
16 vCPUs, 64GiB RAM

Choose the plan that's right for you


$0 per month, just pay for compute

Included in Startup:

Unlimited models and versions
All Baseten features enabled
HIPAA and SOC II compliance
Up to 5 workspace users

Get a custom quote

Everything in Startup plus:

Discounted model resources
Data privacy agreements
Dedicated engineering support
Unlimited workspace users
Self Hosted

Get a custom quote

Everything in Pro plus:

Self-hosted models on your cloud
Multi-stage proof of concept
Live engineering support

Commonly asked questions

  • Baseten is the simplest way to put a model behind an API or webapp hosted on fully managed, scalable infrastructure.
  • You have control over what GPUs your models use. We currently offer NVIDIA T4, A10, V100, and A100 GPUs available. Contact us to learn more or to request additional GPU types.
  • Our servers are located on the U.S. west coast in AWS data centers. More regions are being added to reduce global latency.
  • We bill for the time your model is active, by the minute. You have control over when each model is active, resource instance type, and autoscaling settings. After you use up your free credits, you’ll be asked to add a credit card to your account. At the end of each month, we’ll charge the card on file for your total usage throughout that month.
  • Yes. We offer on-premise deployments on our Enterprise plan. Contact us to learn more.
  • Data and workloads are hosted in AWS. All user workloads are run in isolated environments. We have isolation at hardware & network levels.
  • Yes, we offer significant volume discounts on model resources. Reach out to us at to find out more.
  • Yes, we are happy to support ML efforts for education and non-profit organizations. Contact us at to learn more.