DeepSeek-R1-Distill-Llama-8B
A distilled language model from the DeepSeek-R1 series built on the Llama-3.1-8B base. It is optimized for text generation and chain-of-thought reasoning tasks through reinforcement learning and selective fine-tuning, delivering competitive performance on math, code, and reasoning benchmarks.
Key Information
- Category: Language Models
- Source: Huggingface
- Tags: text-generation
- Last updated: February 24, 2026
Structured Metrics
No structured metrics captured yet.
Links
Canonical source: https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B