Skip to main content

Documentation Index

Fetch the complete documentation index at: https://platform.kimi.ai/docs/llms.txt

Use this file to discover all available pages before exploring further.

Different model families have different defaults and constraints for Chat Completions API parameters. For the full model list, see the Model List.

Parameter Comparison

When temperature is close to 0, n can only be 1. Otherwise, the API returns invalid_request_error.

Kimi K2.6 β€” thinking Parameter

Kimi K2.6 supports the thinking parameter to control whether deep thinking is enabled. Accepts {"type": "enabled"} or {"type": "disabled"}. Since the OpenAI SDK doesn’t have a native thinking parameter, use extra_body:
completion = client.chat.completions.create(
    model="kimi-k2.6",
    messages=[
        {"role": "user", "content": "Hello"}
    ],
    extra_body={
        "thinking": {"type": "disabled"}
    },
    max_tokens=1024*32,
)