Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
aimagelab 's Collections
DICE
ReT-2
RAID
ReflectiVA
ReT
Safe-CLIP
LLaVA-MORE

ReT-2

updated Sep 12

Models and data for the paper "Recurrence Meets Transformers for Universal Multimodal Retrieval" (arXiv 2509.08897)

Upvote
1

  • aimagelab/ReT2-M2KR-CLIP-ViT-B

    Visual Document Retrieval • 0.2B • Updated Sep 12 • 9 • 1

  • aimagelab/ReT2-M2KR-CLIP-ViT-L

    Visual Document Retrieval • 0.4B • Updated Sep 12 • 2

  • aimagelab/ReT2-M2KR-SigLIP2-ViT-L

    Visual Document Retrieval • 0.9B • Updated Sep 12 • 2 • 1

  • aimagelab/ReT2-M2KR-ColBERT-CLIP-ViT-L

    Visual Document Retrieval • 0.4B • Updated Sep 12 • 3

  • aimagelab/ReT2-M2KR-ColBERT-SigLIP2-ViT-L

    Visual Document Retrieval • 0.4B • Updated Sep 13 • 13

  • aimagelab/ReT2-M2KR-OpenCLIP-ViT-H

    Visual Document Retrieval • 1B • Updated Sep 12 • 2

  • aimagelab/ReT2-MBEIR-CLIP-ViT-L

    Visual Document Retrieval • 0.4B • Updated Sep 12 • 2

  • aimagelab/ReT2-MBEIR-SigLIP2-ViT-L

    Visual Document Retrieval • 0.9B • Updated Sep 12 • 2

  • aimagelab/ReT-M2KR

    Preview • Updated Sep 12 • 606 • 2

  • Recurrence Meets Transformers for Universal Multimodal Retrieval

    Paper • 2509.08897 • Published Sep 10
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs