Software Engineer
Machine learning infrastructure that just works
Baseten provides all the infrastructure you need to deploy and serve ML models performantly, scalable, and cost-efficiently.
Software Engineer
Stability AI announced the release of Stable Video Diffusion, marking a huge leap forward for open source novel video synthesis
Building on top of open source models gives you access to a wide range of capabilities that you would otherwise lack from a black box endpoint provider.
To attain the full power of a GPU during LLM inference, you have to know if the inference is compute bound or memory bound. Learn how to better utilize GPU resources.
Out of the box, Stable Diffusion XL 1.0 (SDXL) takes 8-10 seconds to create a 1024x1024px image from a prompt on an A100 GPU. Here’s everything I did to cut SDXL invocation to as fast as 1.92 seconds on an A100.