Kimi-VL-A3B-Thinking

Kimi-VL-A3B-Thinking is an efficient open-source Mixture-of-Experts vision-language model specialized in long-context processing and extended chain-of-thought reasoning. With a 128K context window and only 2.8B activated LLM parameters, it excels in multimodal tasks including image and video comprehension, OCR, mathematical reasoning, and multi-turn agent interactions.

Key Information

  • Category: Vision Models
  • Source: Huggingface
  • Tags: image-text-to-text
  • Last updated: January 09, 2026

Structured Metrics

No structured metrics captured yet.

Links

Canonical source: https://huggingface.co/moonshotai/Kimi-VL-A3B-Thinking