HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices Paper • 2512.14052 • Published 13 days ago • 39
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published 14 days ago • 97
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 27 days ago • 237
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision Paper • 2512.01342 • Published 28 days ago • 15
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 29 days ago • 258
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data Paper • 2511.12609 • Published Nov 16 • 103