DeepSeek-V3.2-Speciale

A high-compute variant of the open-weight DeepSeek-V3.2 large language model optimized for deep reasoning. It features DeepSeek Sparse Attention (DSA) for efficient long-context handling, scaled RL post-training, and an updated chat template with "thinking with tools" encoding (but the Speciale variant itself does not support tool-calling). Weights are available on Hugging Face (Safetensors; BF16/FP8/F32), MIT-licensed, with guidance to run locally following the V3.2-Exp repo instructions.

Key Information

  • Category: Language Models
  • Source: Huggingface
  • Tags: text-generation
  • Last updated: February 24, 2026

Structured Metrics

No structured metrics captured yet.

Links

Canonical source: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale