Product
Product
Platform
Platform
Solutions
Solutions
Developer
Developer
Resources
Resources
Pricing
Pricing
Log in
Get started
Model library
Browse our library of open source models that are ready to deploy behind an API endpoint in seconds.
Deploy your own model
Filter by
All
LLM
Transcription
Text to speech
Image generation
Embedding
33 H100 models
Transcription
Ultravox v0.6 70B
v0.6
-
H100
Model API
LLM
Llama 4 Scout
V4.0
-
Instruct
-
vLLM
-
H100
LLM
Qwen 3 235B
V3
-
SGLang
-
H100
LLM
Qwen 3 4B
V3
-
TRT-LLM
-
H100
Image generation
ZenCtrl
Custom Server
-
H100
LLM
Qwen 3 32B
V3
-
TRT-LLM
-
H100
Embedding
BGE Reranker M3
BEI
-
H100
Embedding
BGE Embedding ICL
BEI
-
H100
LLM
Llama 3.3 Nemotron 49B Super - NVIDIA NIM
3.3
-
Nemotron
-
H100
LLM
Mistral Small 3.1
3.1
-
vLLM
-
H100
LLM
Gemma 3 27B IT
3
-
Instruct
-
vLLM
-
H100
LLM
DeepSeek-R1 Llama 70B
R1
-
Llama
-
TRT-LLM
-
H100
LLM
Llama 3.3 70B Instruct
3.3
-
TRT-LLM
-
H100
LLM
DeepSeek-R1 Qwen 32B
R1
-
Qwen
-
TRT-LLM
-
H100
LLM
Qwen 2.5 14B Instruct
2.5
-
TRT-LLM
-
H100
LLM
Qwen 2.5 32B Coder Instruct
2.5
-
Coder
-
TRT-LLM
-
H100
LLM
Llama 3.1 8B Instruct
3.1
-
Instruct
-
TRT-LLM
-
H100
LLM
Qwen 2.5 32B QwQ
2.5
-
QwQ
-
TRT-LLM
-
H100
LLM
Llama 3.1 405B Instruct
3.1
-
Instruct
-
H100
LLM
Voxtral Small 24B
2507
-
Small
-
H100
Transcription
Ultravox v0.5 8B
v0.5
-
H100
Embedding
Zerank 1 Small
V1
-
H100
Image generation
ZenCtrl Pro
Custom Server
-
H100
LLM
Llama 3.1 Nemotron Ultra 253B
3.1
-
Nemotron
-
TRT-LLM
-
H100
LLM
Pixtral 12B
Pixtral
-
vLLM
-
H100
LLM
Qwen 2.5 72B Instruct
2.5
-
TRT-LLM
-
H100
LLM
Qwen 2.5 72B Math Instruct
2.5
-
Math
-
TRT-LLM
-
H100
LLM
Qwen 2.5 14B Coder Instruct
2.5
-
Coder
-
TRT-LLM
-
H100
LLM
Qwen 2.5 32B Instruct
2.5
-
TRT-LLM
-
H100
LLM
Llama 3.1 70B Instruct
3.1
-
Instruct
-
TRT-LLM
-
H100
LLM
Llama 3.2 90B Vision Instruct
3.2
-
Vision
-
H100
LLM
Mixtral 8x7B Instruct
v1
-
TRT-LLM
-
H100
LLM
Mixtral 8x22B
H100
🔥 Trending models
Model API
LLM
Kimi K2
V2
Text to speech
Orpheus TTS
TRT-LLM
-
H100 MIG 40GB
Model API
LLM
DeepSeek R1 0528
R1
-
0528
-
B200
LLM
Qwen 3 235B
V3
-
SGLang
-
H100
Explore Baseten today
Start deploying
Talk to an engineer