changelog / post
Hot reload for development deployments
truss watch and truss push --watch now support hot-reloading model code changes with the --hot-reload and --watch-hot-reload flags. Instead of restarting the inference server, hot reload swaps your model class in-process: keeping weights and caches loaded for near-instant iteration on predict() logic.
For more information, see Deploy and iterate.