Deploy vLLM models with our OpenAI Bridge
Our OpenAI Bridge is now compatible with vLLM models out of the box! Deploy your vLLM model with Truss and let the docs guide you to an easy integration using the OpenAI completions SDK.
See our latest feature releases, product improvements and bug fixes
Sep 16, 2024
Our OpenAI Bridge is now compatible with vLLM models out of the box! Deploy your vLLM model with Truss and let the docs guide you to an easy integration using the OpenAI completions SDK.
Sep 15, 2024
As of Truss version 0.9.34, you can now promote Chains to a production environment, bringing the same deployment workflow used for Truss Models to Chains. To promote a Chain, simply use the --promote...
Sep 13, 2024
We improved truss watch for more reliable live reloads, so you can test changes on production hardware in seconds without manually pushing via truss push or creating new deployments each time. Adding...
Sep 12, 2024
Models deployed with the TensorRT-LLM Engine Builder now support function calling (aka tool use) and structured output (aka JSON mode). Learn more: Launch announcement blog post Engineering deep dive...
Sep 6, 2024
Stay up to date with all the latest features and updates! New changelog posts will now appear directly in the app. You can view the latest updates through the help menu in the top-right corner. If...
Aug 29, 2024
When using custom build commands in Truss, secrets are often needed; for instance, when installing a pip package from a private GitHub repository. To solve this, you can now use the...
Aug 29, 2024
As a part of using Baseten in CI/CD jobs, we now support a truss login CLI command that allows you to pass an API Key to authenticate. You no longer have to manually edit a ~/.trussrc file. This is...
Aug 29, 2024
Increasingly, we've noticed users interested in deploying models in CI/CD jobs. To make this easier, we now have a way of pushing models to Baseten using a Python SDK. To use it, simply install Truss...
Aug 21, 2024
Deployments that haven't received any traffic and have been scaled to zero for over two weeks will now be automatically deactivated. However, production deployments will remain unaffected by this...
Aug 21, 2024
The model overview page now includes new filtering and sorting options to help you find models more easily. You can filter by running models, scaled-to-zero, inactive, failed, and chains....