morriszms commited on
Commit
609334d
1 Parent(s): e0fdbd8

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ Medichat-Llama3-8B-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ Medichat-Llama3-8B-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ Medichat-Llama3-8B-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ Medichat-Llama3-8B-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ Medichat-Llama3-8B-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ Medichat-Llama3-8B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ Medichat-Llama3-8B-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ Medichat-Llama3-8B-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ Medichat-Llama3-8B-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ Medichat-Llama3-8B-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ Medichat-Llama3-8B-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ Medichat-Llama3-8B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
Medichat-Llama3-8B-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:392c64acf7e824f7135473e3cb97d833b02647efc76045bc5a18ea5006c51234
3
+ size 3179132960
Medichat-Llama3-8B-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b5217a378106e9573a306ba6371bd6c4fb663aba25164ccbfc194c61286e0e37
3
+ size 4321957920
Medichat-Llama3-8B-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4cec3be834df911412a1b1810a3053f108720378e0951b157440352865aa3b23
3
+ size 4018919456
Medichat-Llama3-8B-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6791c9dad7ee2d0142268dd3ee738058edc239f37dc7d5e8d6669a48fea7cc32
3
+ size 3664500768
Medichat-Llama3-8B-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9a38a7d3518d25ed14220e57977041a5c3380fd0559332f236aa03c4beaf0959
3
+ size 4661213216
Medichat-Llama3-8B-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90fbdeea43dce2d2796952a834ebc270578b5745abe2974f5ee29d9e58f7296e
3
+ size 4920735776
Medichat-Llama3-8B-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b325bcc5feda3bff24ed3dd1dae99cfc70496b70b7d01e9ea7020b4f01a8c4dd
3
+ size 4692670496
Medichat-Llama3-8B-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c486d65b6426bf91f8198fd8668c537291704ab56908bc1ba485ed39715a2d4f
3
+ size 5599295520
Medichat-Llama3-8B-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c0aad069826404201438106c8761e50eb7cbd437456c93044ef54bc4f08282f6
3
+ size 5732988960
Medichat-Llama3-8B-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bdc7d3417d577cf8a0f2a6cd2c76dcea13c5de36baca5eb35a031516f1ab91e8
3
+ size 5599295520
Medichat-Llama3-8B-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b25acaae76a7d91e66d2c6985b637880129bf206ca6cee2cb54aa3ad8b44d1e
3
+ size 6596007968
Medichat-Llama3-8B-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:20874cc3fd33d44a7d76db95e96e2537ee63250b0300b2f32d4a53e07f6f23bf
3
+ size 8540772384
README.md ADDED
@@ -0,0 +1,191 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: sethuiyer/Medichat-Llama3-8B
3
+ library_name: transformers
4
+ tags:
5
+ - mergekit
6
+ - merge
7
+ - medical
8
+ - TensorBlock
9
+ - GGUF
10
+ license: other
11
+ datasets:
12
+ - mlabonne/orpo-dpo-mix-40k
13
+ - Open-Orca/SlimOrca-Dedup
14
+ - jondurbin/airoboros-3.2
15
+ - microsoft/orca-math-word-problems-200k
16
+ - m-a-p/Code-Feedback
17
+ - MaziyarPanahi/WizardLM_evol_instruct_V2_196k
18
+ - ruslanmv/ai-medical-chatbot
19
+ language:
20
+ - en
21
+ model-index:
22
+ - name: Medichat-Llama3-8B
23
+ results:
24
+ - task:
25
+ type: text-generation
26
+ name: Text Generation
27
+ dataset:
28
+ name: AI2 Reasoning Challenge (25-Shot)
29
+ type: ai2_arc
30
+ config: ARC-Challenge
31
+ split: test
32
+ args:
33
+ num_few_shot: 25
34
+ metrics:
35
+ - type: acc_norm
36
+ value: 59.13
37
+ name: normalized accuracy
38
+ source:
39
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B
40
+ name: Open LLM Leaderboard
41
+ - task:
42
+ type: text-generation
43
+ name: Text Generation
44
+ dataset:
45
+ name: HellaSwag (10-Shot)
46
+ type: hellaswag
47
+ split: validation
48
+ args:
49
+ num_few_shot: 10
50
+ metrics:
51
+ - type: acc_norm
52
+ value: 82.9
53
+ name: normalized accuracy
54
+ source:
55
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B
56
+ name: Open LLM Leaderboard
57
+ - task:
58
+ type: text-generation
59
+ name: Text Generation
60
+ dataset:
61
+ name: MMLU (5-Shot)
62
+ type: cais/mmlu
63
+ config: all
64
+ split: test
65
+ args:
66
+ num_few_shot: 5
67
+ metrics:
68
+ - type: acc
69
+ value: 60.35
70
+ name: accuracy
71
+ source:
72
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B
73
+ name: Open LLM Leaderboard
74
+ - task:
75
+ type: text-generation
76
+ name: Text Generation
77
+ dataset:
78
+ name: TruthfulQA (0-shot)
79
+ type: truthful_qa
80
+ config: multiple_choice
81
+ split: validation
82
+ args:
83
+ num_few_shot: 0
84
+ metrics:
85
+ - type: mc2
86
+ value: 49.65
87
+ source:
88
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B
89
+ name: Open LLM Leaderboard
90
+ - task:
91
+ type: text-generation
92
+ name: Text Generation
93
+ dataset:
94
+ name: Winogrande (5-shot)
95
+ type: winogrande
96
+ config: winogrande_xl
97
+ split: validation
98
+ args:
99
+ num_few_shot: 5
100
+ metrics:
101
+ - type: acc
102
+ value: 78.93
103
+ name: accuracy
104
+ source:
105
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B
106
+ name: Open LLM Leaderboard
107
+ - task:
108
+ type: text-generation
109
+ name: Text Generation
110
+ dataset:
111
+ name: GSM8k (5-shot)
112
+ type: gsm8k
113
+ config: main
114
+ split: test
115
+ args:
116
+ num_few_shot: 5
117
+ metrics:
118
+ - type: acc
119
+ value: 60.35
120
+ name: accuracy
121
+ source:
122
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=sethuiyer/Medichat-Llama3-8B
123
+ name: Open LLM Leaderboard
124
+ ---
125
+
126
+ <div style="width: auto; margin-left: auto; margin-right: auto">
127
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
128
+ </div>
129
+ <div style="display: flex; justify-content: space-between; width: 100%;">
130
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
131
+ <p style="margin-top: 0.5em; margin-bottom: 0em;">
132
+ Feedback and support: TensorBlock's <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
133
+ </p>
134
+ </div>
135
+ </div>
136
+
137
+ ## sethuiyer/Medichat-Llama3-8B - GGUF
138
+
139
+ This repo contains GGUF format model files for [sethuiyer/Medichat-Llama3-8B](https://huggingface.co/sethuiyer/Medichat-Llama3-8B).
140
+
141
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
142
+
143
+ ## Prompt template
144
+
145
+ ```
146
+ <|im_start|>system
147
+ {system_prompt}<|im_end|>
148
+ <|im_start|>user
149
+ {prompt}<|im_end|>
150
+ <|im_start|>assistant
151
+ ```
152
+
153
+ ## Model file specification
154
+
155
+ | Filename | Quant type | File Size | Description |
156
+ | -------- | ---------- | --------- | ----------- |
157
+ | [Medichat-Llama3-8B-Q2_K.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q2_K.gguf) | Q2_K | 2.961 GB | smallest, significant quality loss - not recommended for most purposes |
158
+ | [Medichat-Llama3-8B-Q3_K_S.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q3_K_S.gguf) | Q3_K_S | 3.413 GB | very small, high quality loss |
159
+ | [Medichat-Llama3-8B-Q3_K_M.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q3_K_M.gguf) | Q3_K_M | 3.743 GB | very small, high quality loss |
160
+ | [Medichat-Llama3-8B-Q3_K_L.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q3_K_L.gguf) | Q3_K_L | 4.025 GB | small, substantial quality loss |
161
+ | [Medichat-Llama3-8B-Q4_0.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q4_0.gguf) | Q4_0 | 4.341 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
162
+ | [Medichat-Llama3-8B-Q4_K_S.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q4_K_S.gguf) | Q4_K_S | 4.370 GB | small, greater quality loss |
163
+ | [Medichat-Llama3-8B-Q4_K_M.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q4_K_M.gguf) | Q4_K_M | 4.583 GB | medium, balanced quality - recommended |
164
+ | [Medichat-Llama3-8B-Q5_0.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q5_0.gguf) | Q5_0 | 5.215 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
165
+ | [Medichat-Llama3-8B-Q5_K_S.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q5_K_S.gguf) | Q5_K_S | 5.215 GB | large, low quality loss - recommended |
166
+ | [Medichat-Llama3-8B-Q5_K_M.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q5_K_M.gguf) | Q5_K_M | 5.339 GB | large, very low quality loss - recommended |
167
+ | [Medichat-Llama3-8B-Q6_K.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q6_K.gguf) | Q6_K | 6.143 GB | very large, extremely low quality loss |
168
+ | [Medichat-Llama3-8B-Q8_0.gguf](https://huggingface.co/tensorblock/Medichat-Llama3-8B-GGUF/tree/main/Medichat-Llama3-8B-Q8_0.gguf) | Q8_0 | 7.954 GB | very large, extremely low quality loss - not recommended |
169
+
170
+
171
+ ## Downloading instruction
172
+
173
+ ### Command line
174
+
175
+ Firstly, install Huggingface Client
176
+
177
+ ```shell
178
+ pip install -U "huggingface_hub[cli]"
179
+ ```
180
+
181
+ Then, downoad the individual model file the a local directory
182
+
183
+ ```shell
184
+ huggingface-cli download tensorblock/Medichat-Llama3-8B-GGUF --include "Medichat-Llama3-8B-Q2_K.gguf" --local-dir MY_LOCAL_DIR
185
+ ```
186
+
187
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
188
+
189
+ ```shell
190
+ huggingface-cli download tensorblock/Medichat-Llama3-8B-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
191
+ ```