Model library

Browse our library of open source models that are ready to deploy behind an API endpoint in seconds.

large language models

Qwen LogoQwen 3 4B

LLM
V3TRT-LLMH100

Qwen LogoQwen 3 32B

LLM
V3TRT-LLMH100

Qwen LogoQwen 3 235B

LLM
V3SGLangH100

Meta logoLlama 4 Scout

LLM
V4.0InstructvLLMH100

Meta logoLlama 4 Maverick

LLM
V4.0InstructvLLMB200

DeepSeek LogoDeepSeek-R1

LLM
R1SGLangB200

text to speech models

Canopy Labs LogoOrpheus TTS

Text to speech
TRT-LLMH100 MIG 40GB

three triangles with the bottom edge missing inside each otherMARS6

Text to speech
V6L4

CoquiXTTS V2

Text to speech
T4

glowing gold goddess, anime style art. Kokoro from the anime TerminatorKokoro

Text to speech
fp16T4

transcription models

OpenAI logoWhisper (best performance)

Transcription
V3H100 MIG 40GB

OpenAI logoWhisperX

Transcription
L4

OpenAI logoWhisper V3

Transcription
V3H100 MIG 40GB

OpenAI logoWhisper V3 Turbo

Transcription
V3TurboH100 MIG 40GB

image generation models

FotographerZenCtrl

Image generation
Custom ServerH100

ByteDance logoSDXL Lightning

Image generation
1.0LightningA100

Stability AI logoStable Diffusion 3 Medium

Image generation
3A100

Stability AI logoStable Diffusion XL

Image generation
XL 1.0A10G

black forest labs logoflux-dev

Image generation
devbloat16H100 MIG 40GB

black forest labs logoflux-schnell

Image generation
schnellbfloat16H100 MIG 40GB

embedding models

Allen AITulu 3 8B Reward

Embedding
V3RewardBEIH100 MIG 40GB

BAAIBGE Reranker M3

Embedding
BEIH100

BAAIBGE Embedding ICL

Embedding
BEIH100

MixedbreadMixedbread Embed Large V1

Embedding
V1EmbeddingBEIL4

Nomic AI logoNomic Embed Code

Embedding
BEIH100 MIG 40GB

DeepSeek models

DeepSeek LogoDeepSeek-R1

LLM
R1SGLangB200

DeepSeek LogoDeepSeek-V3

LLM
V3SGLangB200

DeepSeek LogoDeepSeek-R1 Llama 70B

LLM
R1LlamaTRT-LLMH100

DeepSeek LogoDeepSeek-R1 Qwen 32B

LLM
R1QwenTRT-LLMH100

DeepSeek LogoDeepSeek-R1 Qwen 7B

LLM
R1QwenTRT-LLMH100 MIG 40GB

DeepSeek LogoDeepSeek-R1 Zero

LLM
R1ZeroSGLangH200

Qwen models

Qwen LogoQwen 3 4B

LLM
V3TRT-LLMH100

Qwen LogoQwen 3 32B

LLM
V3TRT-LLMH100

Qwen LogoQwen 3 235B

LLM
V3SGLangH100

Qwen LogoQwen 2.5 14B Instruct

LLM
2.5TRT-LLMH100

Qwen LogoQwen 2.5 32B Coder Instruct

LLM
2.5CoderTRT-LLMH100

Qwen LogoQwen 2.5 7B Math Instruct

LLM
2.5MathTRT-LLMH100 MIG 40GB

Meta models

Meta logoLlama 4 Scout

LLM
V4.0InstructvLLMH100

Meta logoLlama 4 Maverick

LLM
V4.0InstructvLLMB200

Meta logoLlama 3.3 70B Instruct

LLM
3.3TRT-LLMH100

Meta logoLlama 3.1 8B Instruct

LLM
3.1InstructTRT-LLMH100

Meta logoLlama 3.1 405B Instruct

LLM
3.1InstructH100

Deploy any model in just a few commands

Avoid getting tangled in complex deployment processes. Deploy best-in-class open-source models and take advantage of optimized serving for your own models.

$

truss init -- example stable-diffusion-2-1-base ./my-sd-truss

$

cd ./my-sd-truss

$

export BASETEN_API_KEY=MdNmOCXc.YBtEZD0WFOYKso2A6NEQkRqTe

$

truss push

INFO

Serializing Stable Diffusion 2.1 truss.

INFO

Making contact with Baseten 👋 👽

INFO

🚀 Uploading model to Baseten 🚀

Upload progress: 0% | | 0.00G/2.39G