"Inference Engineering" is now available. Get your copy here
changelog / post

Hot reload for development deployments

Go back

truss watch and truss push --watch now support hot-reloading model code changes with the --hot-reload and --watch-hot-reload flags. Instead of restarting the inference server, hot reload swaps your model class in-process: keeping weights and caches loaded for near-instant iteration on predict() logic.


For more information, see Deploy and iterate.