apepkuss79 commited on
Commit
00e829a
1 Parent(s): 0f99a26

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -21
README.md CHANGED
@@ -31,6 +31,10 @@ quantized_by: Second State Inc.
31
 
32
  prompt template: `chatml`
33
 
 
 
 
 
34
  **Context size:**
35
 
36
  chat_ctx_size: `4096`
@@ -42,24 +46,4 @@ chat_ctx_size: `4096`
42
 
43
  - Customize your node: https://docs.gaianet.ai/node-guide/customize
44
 
45
- ## Quantized GGUF Models
46
-
47
- | Name | Quant method | Bits | Size | Use case |
48
- | ---- | ---- | ---- | ---- | ----- |
49
- | [Yi-1.5-34B-Chat-Q2_K.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q2_K.gguf) | Q2_K | 2 |12.8 GB| smallest, significant quality loss - not recommended for most purposes |
50
- | [Yi-1.5-34B-Chat-Q3_K_L.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q3_K_L.gguf) | Q3_K_L | 3 | 18.1 GB| small, substantial quality loss |
51
- | [Yi-1.5-34B-Chat-Q3_K_M.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q3_K_M.gguf) | Q3_K_M | 3 | 16.7 GB| very small, high quality loss |
52
- | [Yi-1.5-34B-Chat-Q3_K_S.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q3_K_S.gguf) | Q3_K_S | 3 | 15 GB| very small, high quality loss |
53
- | [Yi-1.5-34B-Chat-Q4_0.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q4_0.gguf) | Q4_0 | 4 | 19.5 GB| legacy; small, very high quality loss - prefer using Q3_K_M |
54
- | [Yi-1.5-34B-Chat-Q4_K_M.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q4_K_M.gguf) | Q4_K_M | 4 | 20.7 GB| medium, balanced quality - recommended |
55
- | [Yi-1.5-34B-Chat-Q4_K_S.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q4_K_S.gguf) | Q4_K_S | 4 | 19.6 GB| small, greater quality loss |
56
- | [Yi-1.5-34B-Chat-Q5_0.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q5_0.gguf) | Q5_0 | 5 | 23.7 GB| legacy; medium, balanced quality - prefer using Q4_K_M |
57
- | [Yi-1.5-34B-Chat-Q5_K_M.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q5_K_M.gguf) | Q5_K_M | 5 | 23.4 GB| large, very low quality loss - recommended |
58
- | [Yi-1.5-34B-Chat-Q5_K_S.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q5_K_S.gguf) | Q5_K_S | 5 | 23.7 GB| large, low quality loss - recommended |
59
- | [Yi-1.5-34B-Chat-Q6_K.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q6_K.gguf) | Q6_K | 6 | 28.3 GB| very large, extremely low quality loss |
60
- | [Yi-1.5-34B-Chat-Q8_0.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-Q8_0.gguf) | Q8_0 | 8 | 36.5 GB| very large, extremely low quality loss - not recommended |
61
- | [Yi-1.5-34B-Chat-f16-00001-of-00003.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-f16-00001-of-00003.gguf) | f16 | 16 | 32.2 GB| |
62
- | [Yi-1.5-34B-Chat-f16-00002-of-00003.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-f16-00002-of-00003.gguf) | f16 | 16 | 32.1 GB| |
63
- | [Yi-1.5-34B-Chat-f16-00003-of-00003.gguf](https://huggingface.co/gaianet/Yi-1.5-34B-Chat-GGUF/blob/main/Yi-1.5-34B-Chat-f16-00003-of-00003.gguf) | f16 | 16 | 4.48 GB| |
64
-
65
- *Quantized with llama.cpp b2824*
 
31
 
32
  prompt template: `chatml`
33
 
34
+ **Reverse prompt**
35
+
36
+ reverse prompt: `<|im_end|>`
37
+
38
  **Context size:**
39
 
40
  chat_ctx_size: `4096`
 
46
 
47
  - Customize your node: https://docs.gaianet.ai/node-guide/customize
48
 
49
+ *Quantized with llama.cpp b3135*