Learn, Build, Deploy
Baseten supports billions of custom, fine-tuned LLM calls per week from OpenEvidence, serving high-stakes medical information to healthcare providers in every major healthcare facility in the country. If you see a doctor today, chances are that they are leveraging OpenEvidence for trustworthy, up-to-date medical information at their fingertips. Baseten's tireless dedication to reliability and deep support at scale has proven up to the task of supporting this at times literally life-or-death mission.
Working with the Baseten team was a no-brainer. Together, we decreased our model latency by over 50%, reduced our cost per million characters by 44%, and delivered the highest uptime of any inference provider we know of. Baseten has enabled Speechify to provide the highest-quality, lowest-latency, and most cost-efficient AI voice models in the world to consumers, developers, and enterprises.
Blog
All postsIntroducing Baseten Loops

DFlash: 3x faster LLM inference


