AI-MO/Kimina-Prover-RL-1.7B is a 1.7 billion parameter theorem proving model developed by Project Numina and Kimi teams. Fine-tuned from AI-MO/Kimina-Prover-Distill-1.7B, this model specializes in competition-style problem solving within Lean 4. It was trained using reinforcement learning and achieves 76.63% Pass@32 on MiniF2F-test, demonstrating strong capabilities in formal mathematics.
No reviews yet. Be the first to review!