Product
Product
Platform
Platform
Developer
Developer
Resources
Resources
Research
Research
Customers
Customers
Pricing
Pricing
Log in
Get started
Model Performance Team
Model performance
Boosting MTP acceptance in TensorRT-LLM: +40% throughput
Mahmoud Hassan
1 other
Explore Baseten today
Start deploying
Talk to an engineer