arxiv:2408.14547
Nicholas Moratelli
NicholasMoratelli
AI & ML interests
Multimodal Large Language Models - Vision and Language - Foundation Models - GenAI - Compositional AI
Recent Activity
authored
a paper
about 1 month ago
Revisiting Image Captioning Training Paradigm via Direct CLIP-based
Optimization
Organizations
Papers
1
models
None public yet
datasets
None public yet