DeepSeek-Prover-V1.5-RL

DeepSeek-Prover-V1.5-RL is an open‐source language model for formal theorem proving in Lean 4. It refines the previous DeepSeek-Prover models by incorporating reinforcement learning from proof assistant feedback (RLPAF) and a Monte-Carlo tree search variant (RMaxTS) to generate diverse proof paths, achieving state‐of‐the‐art results on miniF2F and ProofNet benchmarks.

Key Information

  • Category: Language Models
  • Source: Huggingface
  • Last updated: February 24, 2026

Structured Metrics

No structured metrics captured yet.

Links

Canonical source: https://huggingface.co/deepseek-ai/DeepSeek-Prover-V1.5-RL