"Inference Engineering" is now available. Get your copy here
changelog / post

Terminate deployment replica via API

Go back

You can now terminate a specific replica within a deployment using the new management API endpoint. This lets you remove individual replicas without affecting the rest of the deployment, making it useful for evicting stuck or unhealthy replicas.

curl --request DELETE \
--url https://api.baseten.co/v1/models/{model_id}/deployments/{deployment_id}/replicas/{replica_id} \
--header "Authorization: Api-Key $BASETEN_API_KEY"

For more information, see the terminate deployment replica docs.