clip_lora_vision_encoder & visual_projevtor with youtube dataset. c4b1330 verified Soran commited on Feb 20