Announcing our Series F. Learn more

Model library

Browse our library of open source models that are ready to deploy behind an API endpoint in seconds.

88 large language models

Z AI Logo
Model API
LLM

GLM 5.2

5.2
poolside
LLM

Laguna M.1

H100
Kimi
Model API
LLM

Kimi K2.7 Code

2.7 - Code
NVIDIA logo
Model API
LLM

NVIDIA Nemotron 3 Super

Super
NVIDIA logo
Model API
LLM

NVIDIA Nemotron 3 Ultra

Ultra
Kimi
Model API
LLM

Kimi K2.6

2.6
Z AI Logo
Model API
LLM

GLM 5.1

5.1
Qwen Logo
LLM

Qwen3.6 27B

V1 - Latency - H100
Meta logo
LLM

Llama 3.3 70B Instruct

3.3 - TRT-LLM - H100
NVIDIA logo
LLM

Nemotron 3 Nano Omni

V1 - Latency - H100
DeepSeek Logo
Model API
LLM

DeepSeek V4

V4 - B200
Z AI Logo
LLM

GLM 4.6

4.6
DeepSeek Logo
Model API
LLM

DeepSeek V3.1

V3.1 - B200
Kimi
LLM

Kimi K2 Thinking

Thinking - K2
Kimi
Model API
LLM

Kimi K2.5

2.5
Z AI Logo
Model API
LLM

GLM 4.7

4.7
Z AI Logo
Model API
LLM

GLM 5

5
MiniMax
Model API
LLM

MiniMax M2.5

M2.5
gemma
LLM

Gemma 4 E2B IT

4 - Latency - H100
gemma
LLM

Gemma 4 E4B IT

4 - Latency - H100
gemma
LLM

Gemma 4 26B A4B IT

4 - Latency - H100
gemma
LLM

Gemma 4 31B IT

4 - Latency - H100
Qwen Logo
LLM

Qwen3.5 9B

V1 - Latency - vLLM - H100
Qwen Logo
LLM

Qwen3.5 35B-A3B

V1 - Latency - vLLM - H100
Qwen Logo
LLM

Qwen3.5 122B-A10B

V1 - Latency - vLLM - H100
Qwen Logo
LLM

Qwen3.5 4B

V1 - Latency - vLLM - H100
OpenAI logo
Model API
LLM

GPT OSS 120B

MoE
DeepSeek Logo
LLM

DeepSeek V3.2

V3.2 - B200
Z AI Logo
LLM

GLM-4.6V

4.6 - Vision
Qwen Logo
Model API
LLM

Qwen3 Coder 480B

3 - Coder
Qwen Logo
LLM

Qwen 3 32B

V3 - TRT-LLM - H100
Qwen Logo
LLM

Qwen3 VL 235B

3 - Vision Language
Z AI Logo
LLM

GLM-4.5V

4.5 - Vision
Qwen Logo
LLM

Qwen3 Coder 30B

3 - Coder
Z AI Logo
LLM

GLM-4.5 Air

4.5 - Air
Fixie Logo
Transcription

Ultravox v0.6 70B

v0.6 - H100
Qwen Logo
LLM

Qwen 3 235B

V3 - SGLang - H100
Qwen Logo
LLM

Qwen 3 4B

V3 - TRT-LLM - H100
Mistral AI logo
LLM

Mistral Small 3.1

3.1 - vLLM - H100
google logo
LLM

Gemma 3 27B IT

3 - Instruct - vLLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Llama 70B

R1 - Llama - TRT-LLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Qwen 32B

R1 - Qwen - TRT-LLM - H100
Meta logo
LLM

Llama 3.1 8B Instruct

3.1 - Instruct - TRT-LLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Qwen 7B

R1 - Qwen - TRT-LLM - H100 MIG 40GB
NVIDIA logo
LLM

Llama 3.1 Nemotron 70B

3.1 - Nemotron - A100
Meta logo
LLM

Llama 3.1 405B Instruct

3.1 - Instruct - H100
Meta logo
LLM

Llama 3.2 11B Vision Instruct

3.2 - Vision - A100
H Company logo
LLM

Holo 3.1 35B-A3B

V1 - Throughput - H100
Qwen Logo
LLM

Qwen3.6 35B-A3B

V1 - Latency - H100
Meta logo
LLM

Llama 4 Maverick

V4.0 - Instruct - vLLM - B200
DeepSeek Logo
Model API
LLM

DeepSeek V3 0324

V3 - 0324 - B200
Z AI Logo
LLM

GLM 4.7 Flash

V1 - Latency - H100
Meta logo
LLM

Llama 4 Scout

V4.0 - Instruct - vLLM - H100
DeepSeek Logo
LLM

DeepSeek R1 0528

R1 - 0528 - B200
Qwen Logo
LLM

Qwen3 Omni Thinker

Omni - Thinker
ByteDance logo
LLM

Seed OSS 36B Instruct

Seed OSS 36B Instruct - Instruct - vLLM - H100
Qwen Logo
LLM

Qwen3 Next 80B A3B Thinking

Qwen3 Next 80B A3B Instruct - Instruct - SGLang - H100
Qwen Logo
LLM

Qwen3 Next 80B A3B Instruct

Qwen3 Next 80B A3B Instruct - Instruct - SGLang - H100
NVIDIA logo
LLM

Llama 3.1 Nemotron Ultra 253B

3.1 - Nemotron - TRT-LLM - H100
Mistral AI logo
LLM

Pixtral 12B

Pixtral - vLLM - H100
Mistral AI logo
LLM

Mistral 7B Instruct

v3 - TRT-LLM - H100 MIG 40GB
Meta logo
LLM

Llama 3.1 70B Instruct

3.1 - Instruct - TRT-LLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Zero

R1 - Zero - SGLang - H200
Meta logo
LLM

Llama 3.2 90B Vision Instruct

3.2 - Vision - H100
Microsoft Logo
LLM

Phi 3.5 Mini Instruct

3.5 - 128k - vLLM - A10G
Microsoft Logo
LLM

Phi 3 Mini 128K Instruct

3 - 128k - T4
DeepSeek Logo
LLM

DeepSeek Prover V2 671B

V2 - Prover - SGLang
Mistral AI logo
LLM

Mixtral 8x7B Instruct

v1 - TRT-LLM - H100

🔥 Trending models