turboderp commited on
Commit
08704f1
1 Parent(s): acfb5c2

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.1
3
+ ---
4
+ EXL2 quants of [Llama-3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct/tree/main)
5
+
6
+ _**This model requires the dev branch of ExLlamaV2 for now. New release coming soon with the necessary changes.**_
7
+
8
+ [2.50 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/2.5bpw)
9
+ [3.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/3.0bpw)
10
+ [3.50 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/3.5bpw)
11
+ [4.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/4.0bpw)
12
+ [4.50 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/4.5bpw)
13
+ [5.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/5.0bpw)
14
+ [6.00 bits per weight](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/tree/6.0bpw)
15
+
16
+ [measurement.json](https://huggingface.co/turboderp/Llama-3.1-70B-Instruct-exl2/blob/main/measurement.json)