Rolling deployments
You can now gradually shift traffic to new deployments instead of swapping all at once. Candidate replicas scale up incrementally while previous replicas scale down in controlled steps, giving you zero-downtime updates. Pause, resume, or cancel mid-rollout if you spot issues. Configure rollout speed with max_surge_percent and stabilization_time_seconds, and choose how to handle the previous deployment after promotion.
Enable rolling deployments in your environment's promotion settings in the dashboard, or through the API:
1curl -X PATCH "https://api.baseten.co/v1/models/{model_id}/environments/{env_name}" \
2 -H "Authorization: Api-Key $BASETEN_API_KEY" \
3 -H "Content-Type: application/json" \
4 -d '{
5 "promotion_settings": {
6 "rolling_deploy": true,
7 "promotion_cleanup_strategy": "SCALE_TO_ZERO",
8 "rolling_deploy_config": {
9 "max_surge_percent": 10,
10 "stabilization_time_seconds": 300
11 }
12 }
13 }'For more information, see Rolling deployments.