UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation Paper • 2510.18701 • Published 8 days ago • 66
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17 • 257
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published Jul 1 • 79
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Paper • 2506.09040 • Published Jun 10 • 34
GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization Paper • 2506.07160 • Published Jun 8 • 3
GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization Paper • 2506.07160 • Published Jun 8 • 3 • 2
GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization Paper • 2506.07160 • Published Jun 8 • 3
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning Paper • 2505.03318 • Published May 6 • 93
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14 • 297