"Inference Engineering" is now available. Get your copy here
changelog / post

Introducing the Baseten Delivery Network (BDN)

Go back

We just launched the Baseten Delivery Network (BDN), designed to make cold starts 2-3x faster for large models.

BDN solves three root causes of slow cold starts: slow weight pulls from upstream storage, replica stampedes under load, and upstream availability dependencies. On first deployment, BDN mirrors your weights to secure storage. From there, a multi-tier cache (node → cluster → mirrored origin) serves weights with consistent hashing and single-flight semantics: each file fetched once per cluster, not once per pod. Fine-tunes sharing weights with a base model only pull the delta.

Check out the launch blog to learn more, or see the docs to get started.