Kimi-K2-Thinking

Kimi K2 Thinking is Moonshot AI’s open‑weight “thinking” LLM optimized for step‑by‑step reasoning and native tool use. It’s a Mixture‑of‑Experts model (1T total params, ~32B active) with a 256K context window and native INT4 quantization (QAT) for faster, lower‑memory inference. It supports long‑horizon agentic workflows (200–300 sequential tool calls), achieves strong/SOTA results on benchmarks like HLE and BrowseComp, and provides chat completion and tool‑calling usage examples. License: modified‑MIT.

Key Information

  • Category: Language Models
  • Source: Huggingface
  • Tags: text-generation
  • Last updated: February 24, 2026

Structured Metrics

No structured metrics captured yet.

Links

Canonical source: https://huggingface.co/moonshotai/Kimi-K2-Thinking