Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,24 @@
|
|
1 |
-
---
|
2 |
-
license: unknown
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: unknown
|
3 |
+
library_name: transformers
|
4 |
+
pipeline_tag: text-generation
|
5 |
+
---
|
6 |
+
|
7 |
+
# Deepseek-V2-Chat-GGUF
|
8 |
+
|
9 |
+
Quantizised from [https://huggingface.co/deepseek-ai/DeepSeek-V2-Chat](https://huggingface.co/deepseek-ai/DeepSeek-V2-Chat)
|
10 |
+
|
11 |
+
Using llama.cpp fork: [https://github.com/fairydreaming/llama.cpp/tree/deepseek-v2](https://github.com/fairydreaming/llama.cpp/tree/deepseek-v2)
|
12 |
+
|
13 |
+
# Warning: This will not work unless you compile llama.cpp from the repo provided!
|
14 |
+
|
15 |
+
# How to use:
|
16 |
+
|
17 |
+
- Find the relevant directory
|
18 |
+
- Download all files
|
19 |
+
- Run merge.py
|
20 |
+
- Merged GGUF should appear
|
21 |
+
|
22 |
+
# Quants:
|
23 |
+
- bf16
|
24 |
+
- q8_0
|