marcusinthesky
's Collections
Multimodal Embeddings
updated
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Paper
•
2403.19651
•
Published
•
22
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency
Determines Multimodal Model Performance
Paper
•
2404.04125
•
Published
•
27
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and
Training Strategies
Paper
•
2404.08197
•
Published
•
27
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Paper
•
2403.20327
•
Published
•
47
OpenGVLab/InternVL-14B-224px
Image Feature Extraction
•
Updated
•
4.42k
•
36
Alibaba-NLP/gte-large-en-v1.5
Sentence Similarity
•
Updated
•
2.49M
•
186
jinaai/jina-embeddings-v2-base-en
Feature Extraction
•
Updated
•
61.3k
•
707
castorini/repllama-v1.1-mrl-7b-lora-passage
Feature Extraction
•
Updated
•
8
•
5
McGill-NLP/LLM2Vec-Sheared-LLaMA-mntp
Sentence Similarity
•
Updated
•
3.26k
•
4
BAAI/bge-visualized
royokong/e5-v
Image-Text-to-Text
•
Updated
•
1.73k
•
18
TIGER-Lab/VLM2Vec-Full
Text Generation
•
Updated
•
22.7k
•
21
openbmb/VisRAG-Ret
Feature Extraction
•
Updated
•
1.54k
•
54