Knowledge Composition using Task Vectors with Learned Anisotropic Scaling Paper • 2407.02880 • Published Jul 3, 2024 • 12 • 3
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20, 2024 • 87 • 25