BytedanceDouyinContent/SAILViT-Huge-600M-448px Image Feature Extraction • 0.7B • Updated Jul 3 • 18 • 3
BytedanceDouyinContent/SAILViT-Large-300M-448px Image Feature Extraction • 0.3B • Updated Jul 3 • 11 • 2
SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model Paper • 2510.12709 • Published 15 days ago • 10
Scalable Vision Language Model Training via High Quality Data Curation Paper • 2501.05952 • Published Jan 10 • 5
SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement Paper • 2507.01643 • Published Jul 2 • 1
MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement Paper • 2508.09670 • Published Aug 13
Scalable Vision Language Model Training via High Quality Data Curation Paper • 2501.05952 • Published Jan 10 • 5
SAIL-VL Collection Scalable Vision Language Model Training via High Quality Data Curation • 6 items • Updated Sep 18 • 1