Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published 18 days ago • 12
Training Task Experts through Retrieval Based Distillation Paper • 2407.05463 • Published Jul 7 • 6
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17 • 13
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models Paper • 2309.03883 • Published Sep 7, 2023 • 33
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning Paper • 2309.10814 • Published Sep 19, 2023 • 3