DeepSeek-R1-Distill-Llama-8B

A distilled language model from the DeepSeek-R1 series built on the Llama-3.1-8B base. It is optimized for text generation and chain-of-thought reasoning tasks through reinforcement learning and selective fine-tuning, delivering competitive performance on math, code, and reasoning benchmarks.

Key Information

  • Category: Language Models
  • Source: Huggingface
  • Tags: text-generation
  • Last updated: February 24, 2026

Structured Metrics

No structured metrics captured yet.

Links

Canonical source: https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B