Model library

Browse our library of open source models that are ready to deploy behind an API endpoint in seconds.

🔥 Trending models

large language models

See all
Meta logo
Model API
LLM

Llama 4 Maverick

V4.0 - Instruct - vLLM - B200
DeepSeek Logo
Model API
LLM

DeepSeek-V3

V3 - SGLang - B200
DeepSeek Logo
Model API
LLM

DeepSeek-R1

R1 - SGLang - B200
Meta logo
Model API
LLM

Llama 4 Scout

V4.0 - Instruct - vLLM - H100
Qwen Logo
LLM

Qwen 3 235B

V3 - SGLang - H100
Qwen Logo
LLM

Qwen 3 4B

V3 - TRT-LLM - H100

text to speech models

See all
Canopy Labs Logo
Text to speech

Orpheus TTS

TRT-LLM - H100 MIG 40GB
three triangles with the bottom edge missing inside each other
Text to speech

MARS6

V6 - L4
Coqui
Text to speech

XTTS V2

T4
glowing gold goddess, anime style art. Kokoro from the anime Terminator
Text to speech

Kokoro

fp16 - T4

transcription models

See all
OpenAI logo
Transcription

Whisper (best performance)

V3 - H100 MIG 40GB
OpenAI logo
Transcription

WhisperX

L4
OpenAI logo
Transcription

Whisper V3

V3 - H100 MIG 40GB
OpenAI logo
Transcription

Whisper V3 Turbo

V3 - Turbo - H100 MIG 40GB

image generation models

See all
Fotographer AI
Image generation

ZenCtrl

Custom Server - H100
ByteDance logo
Image generation

SDXL Lightning

1.0 - Lightning - A100
Stability AI logo
Image generation

Stable Diffusion 3 Medium

3 - A100
Stability AI logo
Image generation

Stable Diffusion XL

XL 1.0 - A10G
black forest labs logo
Image generation

flux-schnell

schnell - bfloat16 - H100 MIG 40GB
Stability AI logo
Image generation

Stable Video Diffusion

Video 1.0 - A100

embedding models

See all
Allen AI
Embedding

Tulu 3 8B Reward

V3 - Reward - BEI - H100 MIG 40GB
BAAI
Embedding

BGE Reranker M3

BEI - H100
BAAI
Embedding

BGE Embedding ICL

BEI - H100
Mixedbread
Embedding

Mixedbread Embed Large V1

V1 - Embedding - BEI - L4
Nomic AI logo
Embedding

Nomic Embed Code

BEI - H100 MIG 40GB

DeepSeek models

See all
DeepSeek Logo
Model API
LLM

DeepSeek-V3

V3 - SGLang - B200
DeepSeek Logo
Model API
LLM

DeepSeek-R1

R1 - SGLang - B200
DeepSeek Logo
LLM

DeepSeek-R1 Llama 70B

R1 - Llama - TRT-LLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Qwen 32B

R1 - Qwen - TRT-LLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Qwen 7B

R1 - Qwen - TRT-LLM - H100 MIG 40GB
DeepSeek Logo
LLM

DeepSeek-R1 Zero

R1 - Zero - SGLang - H200

Qwen models

See all
Qwen Logo
LLM

Qwen 3 235B

V3 - SGLang - H100
Qwen Logo
LLM

Qwen 3 4B

V3 - TRT-LLM - H100
Qwen Logo
LLM

Qwen 3 32B

V3 - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 14B Instruct

2.5 - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 32B Coder Instruct

2.5 - Coder - TRT-LLM - H100
Qwen Logo
LLM

Qwen 2.5 7B Math Instruct

2.5 - Math - TRT-LLM - H100 MIG 40GB

Meta models

See all
Meta logo
Model API
LLM

Llama 4 Maverick

V4.0 - Instruct - vLLM - B200
Meta logo
Model API
LLM

Llama 4 Scout

V4.0 - Instruct - vLLM - H100
Meta logo
LLM

Llama 3.3 70B Instruct

3.3 - TRT-LLM - H100
Meta logo
LLM

Llama 3.1 8B Instruct

3.1 - Instruct - TRT-LLM - H100
Meta logo
LLM

Llama 3.1 405B Instruct

3.1 - Instruct - H100
Meta logo
LLM

Llama 3.2 11B Vision Instruct

3.2 - Vision - A100