Baseten Blog | Page 9


How we achieved SOC 2 and HIPAA compliance as an early-stage company

Baseten is a SOC 2 Type II certified and HIPAA compliant platform for fine-tuning, deploying, and serving ML models, LLMs, and AI models.


Baseten achieves SOC 2 Type II certification

Baseten, an MLOps platform for model deployment & fine-tuning, now boasts SOC 2 type 2 certification, ensuring data security, privacy, and confidentiality.


Technical deep dive: Truss live reload

Truss' live reload feature revolutionizes iterative development, turning the lengthy 3-30 minute model deployment process into an almost instant task.


New in January 2023

Deploy multiple model versions, model resource management, a cleaner Truss DX, and more.

GPU guides

Choosing the right horizontal scaling setup for high-traffic models

Horizontal scaling via replicas with load balancing is an important technique for handling high traffic to an ML model.

GPU guides

How to choose the right instance size for your ML models

This post simplifies instance sizing with heuristics to choose an optimal size for your model, balancing performance and compute cost.


New in December 2022

2022's rapid ML advancements felt like a decade. Excited for 2023, we anticipate foundational models will further empower scientists and developers in ML apps.

Hacks & projects

Serving four million Riffusion requests in two days

Riffusion is a fine-tuned version of Stable Diffusion. Baseten served Riffusion over four million times in a couple of days, serving top-of-hacker-news traffic.


Accelerating model deployment: 100X faster dev loops with development deployments

Baseten's development deployments speed up ML model dev loops, replacing slow workflows with a live reload system for quick, seconds-long testing updates.