Inference will be the largest market ever created.

View open roles

AI’s future won’t be a few massive models built by a handful of labs. It’ll be millions of specialized models embedded into every product, workflow, and experience by the people closest to the customer. The foundation of that future is inference.

Inference determines the performance, reliability, latency, and economics of every AI product. For AI to scale globally, it must be as reliable, fast, cost-effective, and high-quality as possible. That’s why Baseten exists.

We’re an interdisciplinary team of researchers, engineers, and operators building the Inference Cloud our AI future demands. We’re running at a hard systems problem that requires first-principles thinking across the entire stack.

We’re customer obsessed, and it shows. Companies like Abridge, Cursor, Lovable, Notion, and OpenEvidence depend on Baseten to power mission-critical AI workloads in production.

The bar is high. We work hard, move fast, and care deeply about quality.

Sound like a good fit? View open roles below.