"Inference Engineering" is now available. Get your copy here
Resources

Learn, Build, Deploy

Past events

1...678
Aabhas Sharma logo

The reason why we came to Baseten in the first place was the latency requirements. With our bursty workloads, we got queued for our requests similar to any other user of AI. And our customers don't care about who's queuing you.

Aabhas Sharma
CTO
AI engineering
Alex Ker
1 other
Three things you can do right now to optimize your harness
Model performance
Model Performance Team
eagle 3
Infrastructure
Gregory Kofman
2 others
How the Baseten Delivery Network (BDN) makes cold starts fast