Platform
Platform
Solutions
Solutions
Resources
Resources
Pricing
Pricing
Docs
Docs
Log in
Get started
Matt Howard
Software Engineer
Infrastructure
Control plane vs workload plane in model serving infrastructure
Colin McGrath
2 others
Model performance
Continuous vs dynamic batching for AI inference
Matt Howard
1 other
Infrastructure
Using fractional H100 GPUs for efficient model serving
Matt Howard
3 others
Explore Baseten today
Start deploying
Talk to an engineer