When "Correct" Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents? Paper • 2510.17862 • Published 13 days ago • 4
Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published 8 days ago • 61
Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models Paper • 2510.14961 • Published 12 days ago • 6
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published 11 days ago • 79
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 15 days ago • 160
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published 25 days ago • 93
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 22 days ago • 451
BroRL: Scaling Reinforcement Learning via Broadened Exploration Paper • 2510.01180 • Published 27 days ago • 17
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published 28 days ago • 19
Paris: A Decentralized Trained Open-Weight Diffusion Model Paper • 2510.03434 • Published 25 days ago • 2
Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs? Paper • 2510.01161 • Published 27 days ago • 12
Generalized Parallel Scaling with Interdependent Generations Paper • 2510.01143 • Published 27 days ago • 4
Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution Paper • 2509.25301 • Published 29 days ago • 17
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training Paper • 2509.26625 • Published 28 days ago • 43
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks? Paper • 2509.16941 • Published Sep 21 • 20