"Inference Engineering" is now available. Get your copy here
large language

KimiKimi K2.6

Kimi K2.6 builds on Kimi K2.5 with increased agentic capabilities

Model details

Example usage

Kimi K2.6 is the next generation Kimi model that shows meaningful improvements across generalization, decision efficiency, long-horizon task adherence, third-party API understanding, and multi-modal tool use — completing tasks in fewer steps, with better constraint adherence and more robust solutions than its predecessor.

See the docs for more detail:
https://moonshot.feishu.cn/docx/Drp0dBVIUoMPenx3hgUcXYsFnky

Deployments of Kimi K2.6 are OpenAI-compatible.

Input
1# You can use this model with any of the OpenAI clients in any language!
2# Simply change the API Key to get started
3
4from openai import OpenAI
5
6client = OpenAI(
7    api_key="YOUR_API_KEY",
8    base_url="https://inference.baseten.co/v1"
9)
10
11response = client.chat.completions.create(
12    model="moonshotai/Kimi-K2.6",
13    messages=[
14        {
15            "role": "user",
16            "content": "Implement Hello World in Python"
17        }
18    ],
19    stream=True,
20    stream_options={
21        "include_usage": True,
22        "continuous_usage_stats": True
23    },
24    top_p=1,
25    max_tokens=1000,
26    temperature=0.9,
27    presence_penalty=0,
28    frequency_penalty=0
29)
30
31for chunk in response:
32    if chunk.choices and chunk.choices[0].delta.content is not None:
33        print(chunk.choices[0].delta.content, end="", flush=True)
JSON output
1{
2    "id": "143",
3    "choices": [
4        {
5            "finish_reason": "stop",
6            "index": 0,
7            "logprobs": null,
8            "message": {
9                "content": "[Model output here]",
10                "role": "assistant",
11                "audio": null,
12                "function_call": null,
13                "tool_calls": null
14            }
15        }
16    ],
17    "created": 1741224586,
18    "model": "",
19    "object": "chat.completion",
20    "service_tier": null,
21    "system_fingerprint": null,
22    "usage": {
23        "completion_tokens": 145,
24        "prompt_tokens": 38,
25        "total_tokens": 183,
26        "completion_tokens_details": null,
27        "prompt_tokens_details": null
28    }
29}

🔥 Trending models