Hot reload for development deployments

Mar 27, 2026

Go back

truss watch and truss push --watch now support hot-reloading model code changes with the --hot-reload and --watch-hot-reload flags. Instead of restarting the inference server, hot reload swaps your model class in-process: keeping weights and caches loaded for near-instant iteration on predict() logic.

For more information, see Deploy and iterate.

Explore Baseten today

Start deploying Talk to an engineer