Changelog

See our latest feature releases, product improvements and bug fixes

Sep 16, 2024

Deploy vLLM models with our OpenAI Bridge

Our OpenAI Bridge is now compatible with vLLM models out of the box! Deploy your vLLM model with Truss and let the docs guide you to an easy integration using the OpenAI completions SDK.

Sep 15, 2024

Promote Chains to production

As of Truss version 0.9.34, you can now promote Chains to a production environment, bringing the same deployment workflow used for Truss Models to Chains. To promote a Chain, simply use the --promote...

Sep 13, 2024

Seamless remote development with Truss watch

We improved truss watch for more reliable live reloads, so you can test changes on production hardware in seconds without manually pushing via truss push or creating new deployments each time. Adding...

Sep 12, 2024

Structured output and function calling support

Models deployed with the TensorRT-LLM Engine Builder now support function calling (aka tool use) and structured output (aka JSON mode). Learn more: Launch announcement blog post Engineering deep dive...

Sep 6, 2024

In-app changelog notifications

Stay up to date with all the latest features and updates! New changelog posts will now appear directly in the app. You can view the latest updates through the help menu in the top-right corner. If...

Aug 29, 2024

Easier secret management for custom build commands

When using custom build commands in Truss, secrets are often needed; for instance, when installing a pip package from a private GitHub repository. To solve this, you can now use the...

Aug 29, 2024

Authenticate from the CLI with Truss login

As a part of using Baseten in CI/CD jobs, we now support a truss login CLI command that allows you to pass an API Key to authenticate. You no longer have to manually edit a ~/.trussrc file. This is...

Aug 29, 2024

Push models to Baseten using the Truss SDK

Increasingly, we've noticed users interested in deploying models in CI/CD jobs. To make this easier, we now have a way of pushing models to Baseten using a Python SDK. To use it, simply install Truss...

Aug 21, 2024

Automatic deactivation for inactive deployments

Deployments that haven't received any traffic and have been scaled to zero for over two weeks will now be automatically deactivated. However, production deployments will remain unaffected by this...

Aug 21, 2024

Filter and sort models on the overview page

The model overview page now includes new filtering and sorting options to help you find models more easily. You can filter by running models, scaled-to-zero, inactive, failed, and chains....