rwightman's picture
rwightman HF Staff
Add model
5932820 verified
metadata
tags:
  - timm
  - transformers
  - image-feature-extraction
  - mobileclip
  - mobileclip2
library_name: timm
license: apple-amlr
datasets:
  - dfndr-2b

Model card for fastvit_mci3.apple_mclip2_dfndr2b

A MobileCLIP v2 (image encoder only) for timm. Equivalent to image tower from https://huggingface.co/timm/MobileCLIP2-S3-OpenCLIP.

Model Details

Citation

@article{faghri2025mobileclip2,
          title={MobileCLIP2: Improving Multi-Modal Reinforced Training},
          author={Faghri, Fartash and Vasu, Pavan Kumar Anasosalu and Koc, Cem and Shankar, Vaishaal and Toshev, Alexander and Tuzel, Oncel and Pouransari, Hadi},
          journal={arXiv preprint arXiv:2508.20691},
          year={2025}
        }