-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 455 -
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
Paper • 2509.25541 • Published • 137 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 253 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 136
shen
sean29
AI & ML interests
None yet
Recent Activity
updated
a collection
15 days ago
todo
updated
a collection
16 days ago
todo
updated
a collection
16 days ago
todo
Organizations
None yet