Product
Product
Platform
Platform
Solutions
Solutions
Developer
Developer
Resources
Resources
Pricing
Pricing
Log in
Get started
Tri Dao
Model performance
How we run GPT OSS 120B at 500+ tokens per second on NVIDIA GPUs
Amir Haghighat
4 others
Explore Baseten today
Start deploying
Talk to an engineer