DeepSeek-V3.2-Speciale
A high-compute variant of the open-weight DeepSeek-V3.2 large language model optimized for deep reasoning. It features DeepSeek Sparse Attention (DSA) for efficient long-context handling, scaled RL post-training, and an updated chat template with "thinking with tools" encoding (but the Speciale variant itself does not support tool-calling). Weights are available on Hugging Face (Safetensors; BF16/FP8/F32), MIT-licensed, with guidance to run locally following the V3.2-Exp repo instructions.
Key Information
- Category: Language Models
- Source: Huggingface
- Tags: text-generation
- Last updated: February 24, 2026
Structured Metrics
No structured metrics captured yet.
Links
Canonical source: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale