view article Article Exploring Environments Hub: Your Language Model needs better (open) environments to learn By anakin87 β’ Sep 4 β’ 27
view article Article ModernVBERT: Towards Smaller Visual Document Retrievers By paultltc and 4 others β’ 28 days ago β’ 42
view article Article There is no such thing as a tokenizer-free lunch By catherinearnett β’ Sep 25 β’ 84
ποΈ LFM2-VL Collection LFM2-VL is our first series of vision-language models, designed for on-device deployment. β’ 9 items β’ Updated about 20 hours ago β’ 47
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL Jun 3 β’ 93
view article Article NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning By PranjaliJoshi and 1 other β’ May 19 β’ 26
view article Article Page-to-Video: Generate videos from webpages πͺπ¬ By burtenshaw β’ May 6 β’ 27
view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 β’ 180
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other β’ Jan 23 β’ 70