Efficient Adversarial Training in LLMs with Continuous Attacks Paper • 2405.15589 • Published May 24, 2024
Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge Paper • 2112.08583 • Published Dec 16, 2021
Multi-Head Adapter Routing for Cross-Task Generalization Paper • 2211.03831 • Published Nov 7, 2022 • 2
Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference Paper • 2306.12509 • Published Jun 21, 2023 • 14
Towards Modular LLMs by Building and Reusing a Library of LoRAs Paper • 2405.11157 • Published May 18, 2024 • 28