WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning Paper β’ 2509.13305 β’ Published Sep 16 β’ 91
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use Paper β’ 2509.24002 β’ Published Sep 28 β’ 173
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper β’ 2508.21148 β’ Published Aug 28 β’ 140
Intern-S1: A Scientific Multimodal Foundation Model Paper β’ 2508.15763 β’ Published Aug 21 β’ 256
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Paper β’ 2508.15760 β’ Published Aug 21 β’ 46
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper β’ 2412.14475 β’ Published Dec 19, 2024 β’ 55
Running Featured 459 Comparing Captioning Models π₯ 459 Generate captions for images using multiple models