-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 447 -
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
Paper • 2509.25541 • Published • 136 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 242 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 133
shen
sean29
AI & ML interests
None yet
Recent Activity
updated
a collection
12 days ago
todo
updated
a collection
13 days ago
todo
updated
a collection
13 days ago
todo
Organizations
None yet