Shortened LLaMA: A Simple Depth Pruning for Large Language Models Paper • 2402.02834 • Published Feb 5, 2024 • 15
Automatic Neural Network Pruning that Efficiently Preserves the Model Accuracy Paper • 2111.09635 • Published Nov 18, 2021 • 1
On Architectural Compression of Text-to-Image Diffusion Models Paper • 2305.15798 • Published May 25, 2023 • 4