OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value Paper • 2512.14051 • Published 11 days ago • 39
Decouple to Generalize: Context-First Self-Evolving Learning for Data-Scarce Vision-Language Reasoning Paper • 2512.06835 • Published 19 days ago • 3
Decouple to Generalize: Context-First Self-Evolving Learning for Data-Scarce Vision-Language Reasoning Paper • 2512.06835 • Published 19 days ago • 3
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models Paper • 2511.11134 • Published Nov 14 • 31
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights Paper • 2512.01816 • Published 25 days ago • 88
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights Paper • 2512.01816 • Published 25 days ago • 88