Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization Paper • 2405.20648 • Published May 31, 2024
The EarlyBird Gets the WORM: Heuristically Accelerating EarlyBird Convergence Paper • 2406.11872 • Published May 31, 2024
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3 Text Generation • Updated May 13, 2024 • 5
GeorgiaTech/0.0005_zephyr_withdpo_5551_4iters_bs256_newtrl_iter_3 Text Generation • Updated May 12, 2024 • 6
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_2 Text Generation • Updated May 12, 2024 • 59
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_1 Text Generation • Updated May 12, 2024 • 40
Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning Paper • 2306.11065 • Published Jun 19, 2023 • 1
MM-Soc: Benchmarking Multimodal Large Language Models in Social Media Platforms Paper • 2402.14154 • Published Feb 21, 2024 • 2
Mysterious Projections: Multimodal LLMs Gain Domain-Specific Visual Capabilities Without Richer Cross-Modal Projections Paper • 2402.16832 • Published Feb 26, 2024 • 1
Overcoming Language Disparity in Online Content Classification with Multimodal Learning Paper • 2205.09744 • Published May 19, 2022 • 1
Characterizing, Detecting, and Predicting Online Ban Evasion Paper • 2202.05257 • Published Feb 10, 2022
Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions Paper • 2211.02646 • Published Nov 4, 2022
Adversarial Robustness of Prompt-based Few-Shot Learning for Natural Language Understanding Paper • 2306.11066 • Published Jun 19, 2023