SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating Paper • 2606.07074 • Published 17 days ago • 12
GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning Paper • 2511.11653 • Published Nov 10, 2025 • 59