Documentation Index
Fetch the complete documentation index at: https://platform.kimi.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
You can use our List Models API to get a list of currently available models.
Multi-modal Model
| Model Name | Description |
|---|
kimi-k2.6 | Kimi’s most intelligent model to date, achieving open-source SoTA performance in Agent, code, visual understanding, and a range of general intelligent tasks. It is also Kimi’s most versatile model to date, featuring a native multimodal architecture that supports both visual and text input, thinking and non-thinking modes, and dialogue and Agent tasks. Context 256k |
kimi-k2.5 | Kimi’s most intelligent model to date, achieving open-source SoTA performance in Agent, code, visual understanding, and a range of general intelligent tasks. It is also Kimi’s most versatile model to date, featuring a native multimodal architecture that supports both visual and text input, thinking and non-thinking modes, and dialogue and Agent tasks. Context 256k |
Kimi K2 Model
kimi-k2 series models will be officially discontinued on May 25, 2026 and will no longer be maintained or supported. Please use the latest Kimi model kimi-k2.6 for continued support and enhanced reasoning capabilities.
| Model Name | Description |
|---|
kimi-k2-0905-preview | Context length 256k, enhanced Agentic Coding capabilities, front-end code aesthetics and practicality, and context understanding capabilities based on the 0711 version |
kimi-k2-0711-preview | Context length 128k, MoE architecture base model with 1T total parameters, 32B activated parameters. Features powerful code and Agent capabilities. View technical blog |
kimi-k2-turbo-preview | High-speed version of K2, benchmarking against the latest version (0905). Output speed increased to 60-100 tokens per second, context length 256k |
kimi-k2-thinking | K2 Long-term thinking model, supports 256k context, supports multi-step tool usage and reasoning, excels at solving more complex problems |
kimi-k2-thinking-turbo | K2 Long-term thinking model high-speed version, supports 256k context, excels at deep reasoning, output speed increased to 60-100 tokens per second |
Generation Model Moonshot V1
| Model Name | Description |
|---|
moonshot-v1-8k | Suitable for generating short texts, context length 8k |
moonshot-v1-32k | Suitable for generating long texts, context length 32k |
moonshot-v1-128k | Suitable for generating very long texts, context length 128k |
moonshot-v1-8k-vision-preview | Vision model, understands image content and outputs text, context length 8k |
moonshot-v1-32k-vision-preview | Vision model, understands image content and outputs text, context length 32k |
moonshot-v1-128k-vision-preview | Vision model, understands image content and outputs text, context length 128k |
Note: The only difference between these Moonshot V1 models is their maximum context length (including input and output), there is no difference in effect.
Deprecated Models
kimi-latest was officially discontinued on January 28, 2026 and is no longer maintained or supported. Please use the latest Kimi model kimi-k2.6 for continued support and enhanced reasoning capabilities.
kimi-thinking-preview was officially discontinued on November 11, 2025 and is no longer maintained or supported. We recommend upgrading to the latest model kimi-k2.6 for continued support and enhanced reasoning capabilities.
For further assistance, please contact sales.