Efficient Methods for Natural Language Processing: A Survey Paper • 2209.00099 • Published Aug 31, 2022 • 1
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models Paper • 2310.01329 • Published Oct 2, 2023
APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference Paper • 2401.12200 • Published Jan 22 • 1
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published Apr 22 • 126