SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot Paper • 2301.00774 • Published Jan 2, 2023 • 3
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect Paper • 2403.03853 • Published Mar 6 • 61
LLM-Pruner: On the Structural Pruning of Large Language Models Paper • 2305.11627 • Published May 19, 2023 • 3
A Simple and Effective Pruning Approach for Large Language Models Paper • 2306.11695 • Published Jun 20, 2023 • 3
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning Paper • 2310.06694 • Published Oct 10, 2023 • 4
Streamlining Redundant Layers to Compress Large Language Models Paper • 2403.19135 • Published Mar 28 • 1