Inference Engineering is now available. Get your copy here
changelog / post

Structured output and function calling support

Go back

Models deployed with the TensorRT-LLM Engine Builder now support function calling (aka tool use) and structured output (aka JSON mode). Learn more: