Reliable Fine-Grained Evaluation of Natural Language Math Proofs Paper • 2510.13888 • Published 18 days ago • 1