VideoScore2: Think before You Score in Generative Video Evaluation Paper β’ 2509.22799 β’ Published Sep 26 β’ 24
Towards Personalized Deep Research: Benchmarks and Evaluations Paper β’ 2509.25106 β’ Published Sep 29 β’ 27
Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution Paper β’ 2509.25301 β’ Published Sep 29 β’ 17
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation Paper β’ 2509.25849 β’ Published Sep 30 β’ 47
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs Paper β’ 2510.10689 β’ Published 19 days ago β’ 46
ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems Paper β’ 2510.11652 β’ Published 18 days ago β’ 28
Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures Paper β’ 2510.14616 β’ Published 16 days ago β’ 10
A$^2$FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning Paper β’ 2510.12838 β’ Published 18 days ago β’ 22
Scaling Latent Reasoning via Looped Language Models Paper β’ 2510.25741 β’ Published 2 days ago β’ 65
Scaling Latent Reasoning via Looped Language Models Paper β’ 2510.25741 β’ Published 2 days ago β’ 65
Scaling Latent Reasoning via Looped Language Models Paper β’ 2510.25741 β’ Published 2 days ago β’ 65
OAgents: An Empirical Study of Building Effective Agents Paper β’ 2506.15741 β’ Published Jun 17 β’ 35
IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video? Paper β’ 2509.24709 β’ Published Sep 29 β’ 5
VideoScore2: Think before You Score in Generative Video Evaluation Paper β’ 2509.22799 β’ Published Sep 26 β’ 24
Towards Personalized Deep Research: Benchmarks and Evaluations Paper β’ 2509.25106 β’ Published Sep 29 β’ 27
EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing Paper β’ 2509.26346 β’ Published Sep 30 β’ 18
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs Paper β’ 2510.10689 β’ Published 19 days ago β’ 46
ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems Paper β’ 2510.11652 β’ Published 18 days ago β’ 28