Kimi-VL-A3B-Thinking
Kimi-VL-A3B-Thinking is an efficient open-source Mixture-of-Experts vision-language model specialized in long-context processing and extended chain-of-thought reasoning. With a 128K context window and only 2.8B activated LLM parameters, it excels in multimodal tasks including image and video comprehension, OCR, mathematical reasoning, and multi-turn agent interactions.
Key Information
- Category: Vision Models
- Source: Huggingface
- Tags: image-text-to-text
- Last updated: January 09, 2026
Structured Metrics
No structured metrics captured yet.
Links
Canonical source: https://huggingface.co/moonshotai/Kimi-VL-A3B-Thinking