Dracones commited on
Commit
66452d4
1 Parent(s): 6aebc18

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ language:
4
+ - en
5
+ - fr
6
+ - it
7
+ - pt
8
+ - hi
9
+ - es
10
+ - th
11
+ - de
12
+ base_model:
13
+ - meta-llama/Llama-3.3-70B-Instruct
14
+ tags:
15
+ - exl2
16
+ - facebook
17
+ - meta
18
+ - pytorch
19
+ - llama
20
+ - llama-3
21
+ license: llama3.3
22
+ ---
23
+
24
+ # Llama-3.3-70B-Instruct - EXL2 2.75bpw
25
+
26
+ This is a 2.75bpw EXL2 quant of [meta-llama/Llama-3.3-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct)
27
+
28
+ Details about the model can be found at the above model page.
29
+
30
+ ## EXL2 Version
31
+
32
+ These quants were made with exllamav2 version 0.2.4. Quants made on this version of EXL2 may not work on older versions of the exllamav2 library.
33
+
34
+ If you have problems loading these models, please update Text Generation WebUI to the latest version.
35
+
36
+ ## Perplexity Scoring
37
+
38
+ Below are the perplexity scores for the EXL2 models. A lower score is better.
39
+
40
+ | Quant Level | Perplexity Score |
41
+ |-------------|------------------|
42
+ | 5.0 | 4.7932 |
43
+ | 4.5 | 4.8894 |
44
+ | 4.0 | 5.0079 |
45
+ | 3.5 | 5.3992 |
46
+ | 3.0 | 7.2686 |
47
+ | 2.5 | 10.5543 |
48
+ | 2.25 | 8.8764 |