"Inference Engineering" is now available. Get your copy here

changelog / post

Structured output and function calling support

Sep 12, 2024

Go back

Models deployed with the TensorRT-LLM Engine Builder now support function calling (aka tool use) and structured output (aka JSON mode). Learn more:

Explore Baseten today

Start deploying Talk to an engineer