Shortened LLaMA: A Simple Depth Pruning for Large Language Models Paper • 2402.02834 • Published Feb 5 • 12
Efficient Large Language Model Collection Shortened LLMs from Depth Pruning; https://github.com/Nota-NetsPresso/shortened-llm • 10 items • Updated 11 days ago • 4
Efficient Stable Diffusion Collection Block-removed Knowledge-distilled SD models; https://github.com/Nota-NetsPresso/BK-SDM • 9 items • Updated 4 days ago • 2