changelog / post

New model metrics views

Go back

Get cleaner, more accurate insights into your model’s performance and load with the refreshed model metrics charts in each model’s overview tab..

Monitor requests per minute and both mean and peak response times.

Inference volume chart and response time tabInference volume chart and response time tab

And align cross-reference that demand with essential autoscaling metrics like CPU and GPU usage, plus replica count.

GPU usage chart and GPU memory usage tabGPU usage chart and GPU memory usage tab