MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models Paper • 2406.13975 • Published Jun 20, 2024
Effi-Code: Unleashing Code Efficiency in Language Models Paper • 2410.10209 • Published Oct 14, 2024 • 2
SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving Paper • 2505.23932 • Published May 29, 2025
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published 5 days ago • 20
EffiLearner: Enhancing Efficiency of Generated Code via Self-Optimization Paper • 2405.15189 • Published May 24, 2024
MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation Paper • 2405.11430 • Published May 19, 2024 • 2