large language

KimiKimi K2 Thinking

A 1 trillion parameter reasoning model for agents, coding, and writing

Model details

Example usage

The smartest model in the world is now open source.

Baseten offers Dedicated Deployments for Kimi K2 Thinking powered by the Baseten Inference Stack.

Kimi K2 Thinking rivals GPT 5 and Claude Sonnet 4.5 on agentic, coding, and reasoning benchmarks. Deployments of Kimi are OpenAI-compatible.

Kimi K2 Thinking rivals the top closed-source models on the market.Kimi K2 Thinking rivals the top closed-source models on the market.

Input
1# You can use this model with any of the OpenAI clients in any language!
2# Simply change the API Key to get started
3
4from openai import OpenAI
5
6client = OpenAI(
7    api_key="YOUR_API_KEY",
8    base_url="https://inference.baseten.co/v1"
9)
10
11response = client.chat.completions.create(
12    model="moonshotai/Kimi-K2-Thinking",
13    messages=[
14        {
15            "role": "user",
16            "content": "Implement Hello World in Python"
17        }
18    ],
19    stop=[],
20    stream=True,
21    stream_options={
22        "include_usage": True,
23        "continuous_usage_stats": True
24    },
25    top_p=1,
26    max_tokens=1000,
27    temperature=1,
28    presence_penalty=0,
29    frequency_penalty=0
30)
31
32for chunk in response:
33    if chunk.choices and chunk.choices[0].delta.content is not None:
34        print(chunk.choices[0].delta.content, end="", flush=True)
JSON output
1{
2    "id": "143",
3    "choices": [
4        {
5            "finish_reason": "stop",
6            "index": 0,
7            "logprobs": null,
8            "message": {
9                "content": "[Model output here]",
10                "role": "assistant",
11                "audio": null,
12                "function_call": null,
13                "tool_calls": null
14            }
15        }
16    ],
17    "created": 1741224586,
18    "model": "",
19    "object": "chat.completion",
20    "service_tier": null,
21    "system_fingerprint": null,
22    "usage": {
23        "completion_tokens": 145,
24        "prompt_tokens": 38,
25        "total_tokens": 183,
26        "completion_tokens_details": null,
27        "prompt_tokens_details": null
28    }
29}

large language models

See all
Kimi
LLM

Kimi K2 Thinking

Thinking - K2
Z AI
Model API
LLM

GLM 4.6

4.6

Moonshot AI models

See all
Kimi
LLM

Kimi K2 Thinking

Thinking - K2

🔥 Trending models