Update README.md
Browse files
README.md
CHANGED
@@ -2,6 +2,10 @@
|
|
2 |
inference: false
|
3 |
license: apache-2.0
|
4 |
---
|
|
|
|
|
|
|
|
|
5 |
# Vicuna Model Card
|
6 |
|
7 |
## Model Details
|
|
|
2 |
inference: false
|
3 |
license: apache-2.0
|
4 |
---
|
5 |
+
# Overview
|
6 |
+
This model has been pruned to 20% sparsity using the [Wanda pruning method](https://arxiv.org/abs/2306.11695) on attention layers. This method requires no retraining or weight updates and still achieves competitive performance. A link to the base model can be found [here](https://huggingface.co/lmsys/vicuna-7b-v1.3).
|
7 |
+
|
8 |
+
|
9 |
# Vicuna Model Card
|
10 |
|
11 |
## Model Details
|