changelog / post

New autoscaling setting: Target utilization

Go back

We've introduced a new autoscaling setting: Target utilization. This can be used to configure the amount of headroom you'd like on your model or Chain.

Target utilization can be configured either via the UI or via our REST API. See our docs for more information.

Target utilization uses your desired compute usage level to scale model replicas up or down.Target utilization uses your desired compute usage level to scale model replicas up or down.