kaiokendev
/

SuperCOT-LoRA

Model card Files Files and versions Community

kaiokendev commited on May 6, 2023

Commit

d96160d

•

1 Parent(s): 61e5599

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -31,6 +31,7 @@ It uses a mixture of the following datasets:
 - 30B 4-bit 128g CUDA: [https://huggingface.co/tsumeone/llama-30b-supercot-4bit-128g-cuda](https://huggingface.co/tsumeone/llama-30b-supercot-4bit-128g-cuda)
 - 30B 4-bit 128g TRITON: N/A
 - 30B 4-bit CUDA (no groupsize, better VRAM): [https://huggingface.co/tsumeone/llama-30b-supercot-4bit-cuda](https://huggingface.co/tsumeone/llama-30b-supercot-4bit-cuda)
 #### 13B
 - GGML 13B 4-bit: [https://huggingface.co/gozfarb/llama-13b-supercot-ggml](https://huggingface.co/gozfarb/llama-13b-supercot-ggml)

 - 30B 4-bit 128g CUDA: [https://huggingface.co/tsumeone/llama-30b-supercot-4bit-128g-cuda](https://huggingface.co/tsumeone/llama-30b-supercot-4bit-128g-cuda)
 - 30B 4-bit 128g TRITON: N/A
 - 30B 4-bit CUDA (no groupsize, better VRAM): [https://huggingface.co/tsumeone/llama-30b-supercot-4bit-cuda](https://huggingface.co/tsumeone/llama-30b-supercot-4bit-cuda)
+- 30B 3-bit 128g CUDA: [https://huggingface.co/tsumeone/llama-30b-supercot-3bit-128g-cuda](https://huggingface.co/tsumeone/llama-30b-supercot-3bit-128g-cuda)
 #### 13B
 - GGML 13B 4-bit: [https://huggingface.co/gozfarb/llama-13b-supercot-ggml](https://huggingface.co/gozfarb/llama-13b-supercot-ggml)