Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Paper β’ 2405.15574 β’ Published May 24 β’ 52
A Careful Examination of Large Language Model Performance on Grade School Arithmetic Paper β’ 2405.00332 β’ Published May 1 β’ 30
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper β’ 2404.14219 β’ Published Apr 22 β’ 240
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper β’ 2404.14619 β’ Published Apr 22 β’ 124
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper β’ 2404.16994 β’ Published Apr 25 β’ 33
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models Paper β’ 2404.02258 β’ Published Apr 2 β’ 102
Proactive Detection of Voice Cloning with Localized Watermarking Paper β’ 2401.17264 β’ Published Jan 30 β’ 15
High-Quality Image Restoration Following Human Instructions Paper β’ 2401.16468 β’ Published Jan 29 β’ 11
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions Paper β’ 2309.10150 β’ Published Sep 18, 2023 β’ 23