Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,121 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model: [deepseek-ai/DeepSeek-V2-Chat-0628]
|
3 |
+
---
|
4 |
+
|
5 |
+
#### 🚀 Custom quantizations of DeepSeek-V2-Chat-0628 supercharged for CPU inference! 🖥️
|
6 |
+
|
7 |
+
### 🧠 This IQ4XM version uses GGML TYPE IQ_4_XS 4bit in combination with q8_0 bit for blazing fast performance with minimal loss, leveraging int8 optimizations on most newer server CPUs.
|
8 |
+
### 🛠️ While it required some custom code wizardry, it's fully compatible with standard llama.cpp from GitHub or just search for nisten in lmstudio.
|
9 |
+
|
10 |
+
>[!TIP]
|
11 |
+
>🔥 The following 4-bit version is my personal go-to, delivering jaw-dropping performance on ARM cores.
|
12 |
+
>
|
13 |
+
>📁 No need for file concatenation - just point llama-cli at the first file and watch the magic happen!
|
14 |
+
>
|
15 |
+
>💻 Ready to delve in baby? Here's your command-line spell for interactive mode (prompt.txt is optional, but recommended for maximum sorcery):
|
16 |
+
>```bash
|
17 |
+
>./llama-cli --temp 0.4 -m deepseek_0628_cpu_optimized_iq4xm-00001-of-00004.gguf -c 32000 -co -cnv -i -f prompt.txt
|
18 |
+
>```
|
19 |
+
|
20 |
+
```verilog
|
21 |
+
deepseek_0628_cpu_optimized_iq4xm-00001-of-00004.gguf
|
22 |
+
deepseek_0628_cpu_optimized_iq4xm-00002-of-00004.gguf
|
23 |
+
deepseek_0628_cpu_optimized_iq4xm-00003-of-00004.gguf
|
24 |
+
deepseek_0628_cpu_optimized_iq4xm-00004-of-00004.gguf
|
25 |
+
```
|
26 |
+
|
27 |
+
>[!TIP]
|
28 |
+
>### 🚄 Want to download faster than a caffeinated thirsty llama? Here's how:
|
29 |
+
>
|
30 |
+
>🐧 On Linux: `sudo apt install -y aria2`
|
31 |
+
>🍎 On Mac: `brew install aria2`
|
32 |
+
>
|
33 |
+
```bash
|
34 |
+
sudo apt install -y aria2
|
35 |
+
```
|
36 |
+
|
37 |
+
```bash
|
38 |
+
# 🚀 For the turbocharged 4-bit IQ4XM version
|
39 |
+
aria2c -x 8 -o deepseek_0628_cpu_optimized_iq4xm-00001-of-00004.gguf \
|
40 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek_0628_cpu_optimized_iq4xm-00001-of-00004.gguf
|
41 |
+
|
42 |
+
aria2c -x 8 -o deepseek_0628_cpu_optimized_iq4xm-00002-of-00004.gguf \
|
43 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek_0628_cpu_optimized_iq4xm-00002-of-00004.gguf
|
44 |
+
|
45 |
+
aria2c -x 8 -o deepseek_0628_cpu_optimized_iq4xm-00003-of-00004.gguf \
|
46 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek_0628_cpu_optimized_iq4xm-00003-of-00004.gguf
|
47 |
+
|
48 |
+
aria2c -x 8 -o deepseek_0628_cpu_optimized_iq4xm-00004-of-00004.gguf \
|
49 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek_0628_cpu_optimized_iq4xm-00004-of-00004.gguf
|
50 |
+
```
|
51 |
+
```bash
|
52 |
+
# 🏋️ For the nearly lossless Q8_0 version
|
53 |
+
aria2c -x 8 -o deepseek-0628-q8_0-00001-of-00006.gguf \
|
54 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00001-of-00006.gguf
|
55 |
+
|
56 |
+
aria2c -x 8 -o deepseek-0628-q8_0-00002-of-00006.gguf \
|
57 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00002-of-00006.gguf
|
58 |
+
|
59 |
+
aria2c -x 8 -o deepseek-0628-q8_0-00003-of-00006.gguf \
|
60 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00003-of-00006.gguf
|
61 |
+
|
62 |
+
aria2c -x 8 -o deepseek-0628-q8_0-00004-of-00006.gguf \
|
63 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00004-of-00006.gguf
|
64 |
+
|
65 |
+
aria2c -x 8 -o deepseek-0628-q8_0-00005-of-00006.gguf \
|
66 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00005-of-00006.gguf
|
67 |
+
|
68 |
+
aria2c -x 8 -o deepseek-0628-q8_0-00006-of-00006.gguf \
|
69 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-q8_0-00006-of-00006.gguf
|
70 |
+
```
|
71 |
+
```bash
|
72 |
+
# 🧠 For the full-brain BF16 version
|
73 |
+
aria2c -x 8 -o deepseek-0628-bf16-00001-of-00011.gguf \
|
74 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00001-of-00011.gguf
|
75 |
+
|
76 |
+
aria2c -x 8 -o deepseek-0628-bf16-00002-of-00011.gguf \
|
77 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00002-of-00011.gguf
|
78 |
+
|
79 |
+
aria2c -x 8 -o deepseek-0628-bf16-00003-of-00011.gguf \
|
80 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00003-of-00011.gguf
|
81 |
+
|
82 |
+
aria2c -x 8 -o deepseek-0628-bf16-00004-of-00011.gguf \
|
83 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00004-of-00011.gguf
|
84 |
+
|
85 |
+
aria2c -x 8 -o deepseek-0628-bf16-00005-of-00011.gguf \
|
86 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00005-of-00011.gguf
|
87 |
+
|
88 |
+
aria2c -x 8 -o deepseek-0628-bf16-00006-of-00011.gguf \
|
89 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00006-of-00011.gguf
|
90 |
+
|
91 |
+
aria2c -x 8 -o deepseek-0628-bf16-00007-of-00011.gguf \
|
92 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00007-of-00011.gguf
|
93 |
+
|
94 |
+
aria2c -x 8 -o deepseek-0628-bf16-00008-of-00011.gguf \
|
95 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00008-of-00011.gguf
|
96 |
+
|
97 |
+
aria2c -x 8 -o deepseek-0628-bf16-00009-of-00011.gguf \
|
98 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00009-of-00011.gguf
|
99 |
+
|
100 |
+
aria2c -x 8 -o deepseek-0628-bf16-00010-of-00011.gguf \
|
101 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00010-of-00011.gguf
|
102 |
+
|
103 |
+
aria2c -x 8 -o deepseek-0628-bf16-00011-of-00011.gguf \
|
104 |
+
https://huggingface.co/nisten/deepseek-0628-gguf/resolve/main/deepseek-0628-bf16-00011-of-00011.gguf
|
105 |
+
```
|
106 |
+
|
107 |
+
📜 The use of DeepSeek-V2-Chat-0628 model is subject to the [DeepSeek Model License](https://github.com/deepseek-ai/DeepSeek-V2/blob/main/LICENSE-MODEL). DeepSeek-V2 series supports commercial use. It's a permissive license that only restricts use for military purposes, harming minors, or patent trolling.
|
108 |
+
|
109 |
+
### 🌟 Model Information
|
110 |
+
|
111 |
+
DeepSeek-V2-Chat-0628 is the latest and greatest in the DeepSeek family. This AI powerhouse has climbed the LMSYS Chatbot Arena Leaderboard faster than a rocket on steroids:
|
112 |
+
|
113 |
+
- 🏆 Overall Arena Ranking: #11 global
|
114 |
+
- 💻 Coding Arena Ranking: #3, global
|
115 |
+
- 🧠 Hard Prompts Arena Ranking: #7 global, better than claude opus even in english only hard-prompts
|
116 |
+
|
117 |
+
Want to seek deeper into this model's ocean of awesomeness? Swim over to the [original model card](https://huggingface.co/deepseek-ai/DeepSeek-V2-Chat-0628) and prepare to have your mind blown! 🤯
|
118 |
+
|
119 |
+
Now go forth and accelerate 🚀💡
|
120 |
+
|
121 |
+
-Nisten
|