"Inference Engineering" is now available. Get your copy here
large language

Qwen LogoQwen 2.5 72B Math Instruct

The largest model in the Qwen family of LLMs, tuned for math

Model details

View repository

Example usage

Qwen 2.5 72B Math is an OpenAI-compatible model and can be called using the OpenAI SDK in any language.

Input
1from openai import OpenAI
2import os
3
4model_url = "" # Copy in from API pane in Baseten model dashboard
5
6client = OpenAI(
7    api_key=os.environ['BASETEN_API_KEY'],
8    base_url=model_url
9)
10
11# Chat completion
12response_chat = client.chat.completions.create(
13    model="",
14    messages=[
15        {"role": "user", "content": "Tell me a fun fact about cats."}
16    ],
17    temperature=0.3,
18    max_tokens=100,
19)
20print(response_chat)
JSON output
1{
2    "id": "143",
3    "choices": [
4        {
5            "finish_reason": "stop",
6            "index": 0,
7            "logprobs": null,
8            "message": {
9                "content": "[Model output here]",
10                "role": "assistant",
11                "audio": null,
12                "function_call": null,
13                "tool_calls": null
14            }
15        }
16    ],
17    "created": 1741224586,
18    "model": "",
19    "object": "chat.completion",
20    "service_tier": null,
21    "system_fingerprint": null,
22    "usage": {
23        "completion_tokens": 145,
24        "prompt_tokens": 38,
25        "total_tokens": 183,
26        "completion_tokens_details": null,
27        "prompt_tokens_details": null
28    }
29}

large language models

See all
DeepSeek Logo
LLM

DeepSeek V3.2

V3.2 - B200
MiniMax
Model API
LLM

MiniMax M2.5

M2.5
Z AI
Model API
LLM

GLM 5

5

Qwen models

See all
Qwen Logo
Embedding

Qwen3 8B Reranker

BEI - H100 MIG 40GB
Qwen Logo
Model API
LLM

Qwen3 Coder 480B

3 - Coder

🔥 Trending models