view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21 • 234
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 56
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12 • 473
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Jul 10 • 86
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Jul 10 • 60
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated Jul 10 • 151
Turkish Vision-Language Datasets Collection Collection of Turkish vision-language datasets. • 30 items • Updated Jul 8 • 11
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 74
view article Article Llama can now see and run on your device - welcome Llama 3.2 +5 Sep 25, 2024 • 191