Multimodal Embeddings - a marcusinthesky Collection

marcusinthesky 's Collections

ZecRec

DS

Open-vocabulary object detection (OVD).

Multi-modal Mamba

Multimodal Embeddings

Tiny VLM Decoder

PeFT

Decoder Upcycled to Embeddings

Multimodal Embeddings

updated Oct 19, 2024

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Paper • 2403.19651 • Published Mar 28, 2024 • 25
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

Paper • 2404.04125 • Published Apr 4, 2024 • 29
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Paper • 2404.08197 • Published Apr 12, 2024 • 29
Gecko: Versatile Text Embeddings Distilled from Large Language Models

Paper • 2403.20327 • Published Mar 29, 2024 • 48
OpenGVLab/InternVL-14B-224px

Image Feature Extraction • 14B • Updated Dec 9, 2024 • 524 • 35
Alibaba-NLP/gte-large-en-v1.5

Sentence Similarity • 0.4B • Updated Aug 9, 2024 • 3.43M • 227
jinaai/jina-embeddings-v2-base-en

Feature Extraction • 0.1B • Updated Jan 6 • 173k • 728
castorini/repllama-v1.1-mrl-7b-lora-passage

Feature Extraction • 7B • Updated May 12, 2024 • 4 • 5
McGill-NLP/LLM2Vec-Sheared-LLaMA-mntp

Sentence Similarity • Updated May 21, 2024 • 1.75k • 5
BAAI/bge-visualized

Updated Dec 23, 2024 • 66
royokong/e5-v

Image-to-Text • 8B • Updated Oct 31, 2024 • 20.9k • 28
TIGER-Lab/VLM2Vec-Full

Text Generation • 4B • Updated Apr 7 • 44.3k • 28
openbmb/VisRAG-Ret

Feature Extraction • 3B • Updated Nov 4, 2024 • 1.43k • 71