DeepSeek-V3.1-Base

DeepSeek-V3.1-Base is a highly advanced, long-context text generation model that supports both thinking and non-thinking modes. It introduces hybrid thinking mode, improved tool calling, and enhanced efficiency compared to preceding versions. With 671B parameters (37B activated) and a 128K context window, it is optimized using the UE8M0 FP8 scale format and represents a significant upgrade in tool usage and agent tasks. It is designed for complex conversational and code generation tasks.

Key Information

  • Category: Language Models
  • Source: Huggingface
  • Tags: text-generation
  • Last updated: February 24, 2026

Structured Metrics

No structured metrics captured yet.

Links

Canonical source: https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Base