InternViT-6B + QLLaMA, can be used for image-text retrieval like CLIP
4
#5 opened 2 months ago
by
vitvit
Fix incorrect image embedding when running with a single GPU and 24GB VRAM
1
#3 opened 9 months ago
by
xdedss
