"Inference Engineering" is now available. Get your copy here
changelog / post

Health check improvements

Go back

Startup probes now handle initialization more reliably by waiting until the model has loaded before executing any liveness checks. The startup phase still defaults to 30 minutes and can be configured up to 50 minutes through the  startup_threshold_seconds parameter.

For more information, see Custom health checks.