FutureSim: Replaying World Events to Evaluate Adaptive Agents Paper • 2605.15188 • Published May 14 • 7
Running Agents 3 FutureSim Agent Trajectories 🚀 3 Trajectories of frontier agents on the FutureSim benchmark.
Running Agents 3 FutureSim Agent Trajectories 🚀 3 Trajectories of frontier agents on the FutureSim benchmark.
Training AI Co-Scientists Using Rubric Rewards Paper • 2512.23707 • Published Dec 29, 2025 • 21
Scaling Open-Ended Reasoning to Predict the Future Paper • 2512.25070 • Published Dec 31, 2025 • 20