view article Article cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents 9 days ago • 9
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies Paper • 2512.19673 • Published 3 days ago • 54
DeContext as Defense: Safe Image Editing in Diffusion Transformers Paper • 2512.16625 • Published 7 days ago • 24
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published 8 days ago • 55
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation Paper • 2512.16913 • Published 7 days ago • 31
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 8 days ago • 74
LLaDA2.0: Scaling Up Diffusion Language Models to 100B Paper • 2512.15745 • Published 15 days ago • 75
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published 8 days ago • 41
OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value Paper • 2512.14051 • Published 9 days ago • 38