Dracones commited on
Commit
171fe4f
1 Parent(s): 15b2d32

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +72 -1
README.md CHANGED
@@ -16,7 +16,11 @@ This is a 5.0bpw EXL2 quant of [sophosympatheia/Midnight-Miqu-70B-v1.5](https://
16
 
17
  Details about the model and the merge info can be found at the above mode page.
18
 
19
- I have not extensively tested this quant/model other than ensuring I could load it and chat with it.
 
 
 
 
20
 
21
  ## Tavern Card
22
 
@@ -31,6 +35,73 @@ _In one hand, he holds a smoldering pipe filled with exotic herbs, casting a mys
31
  _The room is cluttered with tomes and scrolls, floating in midair as if held by invisible hands, creating a maelstrom of knowledge. Behind him, a crystal ball reflects swirling images of distant lands and times, while a cauldron bubbles with unknown concoctions on the hearth of an ancient fireplace. The scene exudes an air of enigma and might, leaving you both awestruck and slightly intimidated in the presence of this legendary figure from Thaylonia._
32
 
33
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
34
  ## Quant Details
35
 
36
  This is the script used for quantization.
 
16
 
17
  Details about the model and the merge info can be found at the above mode page.
18
 
19
+ ## Prompt Templates
20
+
21
+ Please see [sophosympatheia/Midnight-Miqu-70B-v1.5](https://huggingface.co/sophosympatheia/Midnight-Miqu-70B-v1.5) for Silly Tavern presets and templates.
22
+
23
+ Further details on prompting this model will also pop up under the [model discussions](https://huggingface.co/sophosympatheia/Midnight-Miqu-70B-v1.0/discussions)
24
 
25
  ## Tavern Card
26
 
 
35
  _The room is cluttered with tomes and scrolls, floating in midair as if held by invisible hands, creating a maelstrom of knowledge. Behind him, a crystal ball reflects swirling images of distant lands and times, while a cauldron bubbles with unknown concoctions on the hearth of an ancient fireplace. The scene exudes an air of enigma and might, leaving you both awestruck and slightly intimidated in the presence of this legendary figure from Thaylonia._
36
 
37
 
38
+ ## Perplexity Scoring
39
+
40
+ Below are the perplexity scores for the EXL2 models. A lower score is better.
41
+
42
+ | Quant Level | Perplexity Score |
43
+ |-------------|------------------|
44
+ | 5.0 | 5.1226 |
45
+ | 4.5 | 5.1590 |
46
+ | 4.0 | 5.1772 |
47
+ | 3.5 | 5.3030 |
48
+ | 3.0 | 5.4156 |
49
+ | 2.75 | 5.8717 |
50
+ | 2.5 | 5.7236 |
51
+ | 2.25 | 6.4102 |
52
+
53
+
54
+ ## EQ Bench
55
+
56
+ Here are the EQ Bench scores for the EXL2 quants using Alpaca, ChatML, Mistral, Vicuna-v1.1 and Vicuna-v0 prompt templates. A higher score is better.
57
+
58
+ | Quant Size | Alpaca | ChatML | Mistral | Vicuna-v0 | Vicuna-v1.1 |
59
+ |------------|--------|--------|--------|--------|--------|
60
+ | 5.0 | 77.38 | 76.25 | 77.67 | 78.83 | 77.86 |
61
+ | 4.5 | 75.45 | 74.76 | 76.06 | 76.39 | 76.28 |
62
+ | 4.0 | 77.99 | 75.25 | 77.18 | 76.84 | 76.08 |
63
+ | 3.5 | 73.47 | 71.83 | 72.6 | 72.0 | 74.77 |
64
+ | 3.0 | 71.46 | 70.33 | 71.06 | 72.75 | 72.21 |
65
+ | 2.75 | 76.41 | 72.76 | 75.99 | 76.06 | 77.19 |
66
+ | 2.5 | 74.61 | 74.78 | 75.58 | 74.2 | 75.55 |
67
+ | 2.25 | 72.76 | 71.28 | 72.89 | 72.81 | 71.91 |
68
+
69
+
70
+ ### Perplexity Script
71
+
72
+ This was the script used for perplexity testing.
73
+
74
+ ```bash
75
+ #!/bin/bash
76
+
77
+ # Activate the conda environment
78
+ source ~/miniconda3/etc/profile.d/conda.sh
79
+ conda activate exllamav2
80
+
81
+ export CUDA_HOME=/home/mmealman/miniconda3/envs/exllamav2
82
+ export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$CUDA_HOME/lib
83
+ export PATH=$PATH:$CUDA_HOME/bin
84
+
85
+ # Set the model name and bit size
86
+ MODEL_NAME="miqu-1-70b-sf"
87
+ BIT_PRECISIONS=(5.0 4.5 4.0 3.5 3.0 2.75 2.5 2.25)
88
+
89
+ # Print the markdown table header
90
+ echo "| Quant Level | Perplexity Score |"
91
+ echo "|-------------|------------------|"
92
+
93
+ for BIT_PRECISION in "${BIT_PRECISIONS[@]}"
94
+ do
95
+ MODEL_DIR="models/${MODEL_NAME}_exl2_${BIT_PRECISION}bpw"
96
+ if [ -d "$MODEL_DIR" ]; then
97
+ output=$(python test_inference.py -m "$MODEL_DIR" -gs 22,24 -ed data/wikitext/wikitext-2-v1.parquet)
98
+ score=$(echo "$output" | grep -oP 'Evaluation perplexity: \K[\d.]+')
99
+ echo "| $BIT_PRECISION | $score |"
100
+ fi
101
+ done
102
+ ```
103
+
104
+
105
  ## Quant Details
106
 
107
  This is the script used for quantization.