fbaldassarri
commited on
Commit
•
43551de
1
Parent(s):
5492af4
Update README
Browse files
README.md
CHANGED
@@ -22,7 +22,13 @@ This model has been quantized in INT4, group-size 128, and optimized for inferen
|
|
22 |
This model has been quantized using Intel [auto-round](https://github.com/intel/auto-round), based on [SignRound technique](https://arxiv.org/pdf/2309.05516v4).
|
23 |
|
24 |
```
|
25 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
--model_name ./models/sapienzanlp_modello-italia-9b \
|
27 |
--device 0 \
|
28 |
--group_size 128 \
|
|
|
22 |
This model has been quantized using Intel [auto-round](https://github.com/intel/auto-round), based on [SignRound technique](https://arxiv.org/pdf/2309.05516v4).
|
23 |
|
24 |
```
|
25 |
+
git clone https://github.com/fbaldassarri/model-conversion.git
|
26 |
+
```
|
27 |
+
|
28 |
+
Then,
|
29 |
+
|
30 |
+
```
|
31 |
+
python3 main.py \
|
32 |
--model_name ./models/sapienzanlp_modello-italia-9b \
|
33 |
--device 0 \
|
34 |
--group_size 128 \
|