Platform
Platform
Solutions
Solutions
Resources
Resources
Pricing
Pricing
Docs
Docs
Log in
Get started
Rachel Rapp
Product
News
Introducing Baseten Embeddings Inference: The fastest embeddings solution available
Michael Feil
1 other
News
Baseten Chains is now GA for production compound AI systems
Marius Killinger
2 others
News
New observability features: activity logging, LLM metrics, and metrics dashboard customization
Suren Atoyan
4 others
News
Introducing our Speculative Decoding Engine Builder integration for ultra-low-latency LLM inference
Justin Yi
3 others
Model performance
Generally Available: The fastest, most accurate and cost-efficient Whisper transcription
William Gao
3 others
News
Introducing Custom Servers: Deploy production-ready model servers from Docker images
Tianshu Cheng
2 others
News
Create custom environments for deployments on Baseten
Samiksha Pal
3 others
News
Introducing canary deployments on Baseten
Sid Shanker
3 others
News
Baseten partners with Google Cloud to deliver high-performance AI infrastructure to a broader audience
Mike Bilodeau
1 other
1
2
Explore Baseten today
Start deploying
Talk to an engineer