marcusinthesky
's Collections
Multimodal Embeddings
updated
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Paper
•
2403.19651
•
Published
•
25
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency
Determines Multimodal Model Performance
Paper
•
2404.04125
•
Published
•
29
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and
Training Strategies
Paper
•
2404.08197
•
Published
•
29
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Paper
•
2403.20327
•
Published
•
48
OpenGVLab/InternVL-14B-224px
Image Feature Extraction
•
14B
•
Updated
•
524
•
35
Alibaba-NLP/gte-large-en-v1.5
Sentence Similarity
•
0.4B
•
Updated
•
3.43M
•
227
jinaai/jina-embeddings-v2-base-en
Feature Extraction
•
0.1B
•
Updated
•
173k
•
728
castorini/repllama-v1.1-mrl-7b-lora-passage
Feature Extraction
•
7B
•
Updated
•
4
•
5
McGill-NLP/LLM2Vec-Sheared-LLaMA-mntp
Sentence Similarity
•
Updated
•
1.75k
•
5
BAAI/bge-visualized
Updated
•
66
royokong/e5-v
Image-to-Text
•
8B
•
Updated
•
20.9k
•
28
TIGER-Lab/VLM2Vec-Full
Text Generation
•
4B
•
Updated
•
44.3k
•
28
openbmb/VisRAG-Ret
Feature Extraction
•
3B
•
Updated
•
1.43k
•
71