Jul 30, 2025
Go back
We now support calling models via gRPC! gRPC is type-safe, supports streaming, and is language interoperable, making it great for:
Low-latency applications (e.g., video processing)
Microservices
Read the docs to get started.
Popular models
GLM 5.2
Kimi K2.7 Code
DeepSeek V4
GPT OSS 120B
Whisper Large V3
NVIDIA Nemotron 3 Ultra
Explore all