Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published 6 days ago • 72
Training-Free Bayesianization for Low-Rank Adapters of Large Language Models Paper • 2412.05723 • Published Dec 7, 2024 • 2
DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models Paper • 2410.08207 • Published Oct 10, 2024 • 19
Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models Paper • 2406.12649 • Published Jun 18, 2024 • 16
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models Paper • 2406.11230 • Published Jun 17, 2024 • 34
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models Paper • 2406.11230 • Published Jun 17, 2024 • 34