Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2604.12374 • Published 7 days ago • 35
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 13 days ago • 317
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 6 days ago • 147
Toward Autonomous Long-Horizon Engineering for ML Research Paper • 2604.13018 • Published 7 days ago • 34
Efficient RL Training for LLMs with Experience Replay Paper • 2604.08706 • Published 12 days ago • 17
General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks Paper • 2604.11778 • Published 8 days ago • 8
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 12 days ago • 239
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Paper • 2603.27490 • Published 23 days ago • 17
Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 61 items • Updated about 3 hours ago • 142
Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models Paper • 2604.08545 • Published 12 days ago • 41
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 12 days ago • 280
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks Paper • 2604.08539 • Published 12 days ago • 48
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 13 days ago • 70
AMoE: Agglomerative Mixture-of-Experts Vision Foundation Model Paper • 2512.20157 • Published Dec 23, 2025 • 5
Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework Paper • 2604.06170 • Published 14 days ago • 31