DeepSeek-Prover-V1.5-RL
DeepSeek-Prover-V1.5-RL is an open‐source language model for formal theorem proving in Lean 4. It refines the previous DeepSeek-Prover models by incorporating reinforcement learning from proof assistant feedback (RLPAF) and a Monte-Carlo tree search variant (RMaxTS) to generate diverse proof paths, achieving state‐of‐the‐art results on miniF2F and ProofNet benchmarks.
Key Information
- Category: Language Models
- Source: Huggingface
- Last updated: February 24, 2026
Structured Metrics
No structured metrics captured yet.
Links
Canonical source: https://huggingface.co/deepseek-ai/DeepSeek-Prover-V1.5-RL