MoonshotAI logo

Kimi K2 Thinking: Pricing, Context Window & Benchmarks

by MoonshotAI

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in Kimi K2, it activates 32 billion parameters per forward pass and supports 256 k-token context windows. The model is optimized for persistent step-by-step thought, dynamic tool invocation, and complex reasoning workflows that span hundreds of turns. It interleaves step-by-step reasoning with tool use, enabling autonomous research, coding, and writing that can persist for hundreds of sequential actions without drift. It sets new open-source benchmarks on HLE, BrowseComp, SWE-Multilingual, and LiveCodeBench, while maintaining stable multi-agent behavior through 200–300 tool calls. Built on a large-scale MoE architecture with MuonClip optimization, it combines strong reasoning depth with high inference efficiency for demanding agentic and analytical tasks.

Chat with Kimi K2 Thinking
Input Price
$0.60/1M tokens
Output Price
$2.50/1M tokens
Intelligence
40.7
Coding
34.8

What you can do with Kimi K2 Thinking

Everyday Q&A and clear explanations

Writing help (emails, posts, summaries)

Idea generation and brainstorming

Learning support with step-by-step guidance

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Benchmark Highlights

5 tests
GPQA
83.8%
MMLU Pro
84.8%
LiveCodeBench
85.3%
Math 500
N/A
AIME 2025
94.7%
HLE
22.3%
Metric Value
Provider MoonshotAI
Context Window 131,072 tokens
Input Price $0.60/1M tokens
Output Price $2.50/1M tokens
Release Date Nov 6, 2025
Modalities text
Capabilities N/A

Compare Kimi K2 Thinking to other models

See how it stacks up on price, quality, and overall performance.

Frequently asked questions

What is Kimi K2 Thinking good for?

Use Kimi K2 Thinking for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Kimi K2 Thinking cost?

Pricing is based on usage. Current rates are $0.60/1M tokens for input and $2.50/1M tokens for output.

Can I try Kimi K2 Thinking for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Kimi K2 Thinking support images or audio?

Kimi K2 Thinking focuses on text-based tasks.

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.