nisten commited on
Commit
15c8764
1 Parent(s): b9c9d0b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -9,11 +9,11 @@ the imatrix.dat file was calcuated over 1000 chunks with wikitext.train.raw( inc
9
 
10
  Wrote a bit of custom c++ to avoid quantizing certain layers, tested fully compatible with llama.cpp as of 10April2024.
11
 
12
- To put the 8bit file back together do
13
 
14
  ```
15
- cat ~mix4ns.gguf.part* > ~mix4ns.gguf && rm -rf mix4ns.gguf.part*
16
- cat ~mix8ns.gguf.part* > ~mix8ns.gguf && rm -rf mix4ns.gguf.part*
17
  ```
18
  careful this can take 5 minutes or up to 10-15 on slow instances, check progress with ls -la
19
 
 
9
 
10
  Wrote a bit of custom c++ to avoid quantizing certain layers, tested fully compatible with llama.cpp as of 10April2024.
11
 
12
+ To put it all asa single file ( this is not needed with llama.cpp as it will autodetect the chunks but can help troubleshooting ollama)
13
 
14
  ```
15
+ cat mix4ns-0000* > mix4ns.gguf
16
+
17
  ```
18
  careful this can take 5 minutes or up to 10-15 on slow instances, check progress with ls -la
19