ethzanalytics
/

RedPajama-INCITE-Chat-3B-v1-GPTQ-4bit-128g

Text Generation

Model card Files Files and versions Community

pszemraj commited on May 7, 2023

Commit

94d2b29

•

1 Parent(s): cd04588

Update README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -3,23 +3,26 @@ license: apache-2.0
 inference: false
 tags:
 - auto-gptq
 ---
-# redpajama gptq
 <a href="https://colab.research.google.com/gist/pszemraj/86d2e8485df182302646ed2c5a637059/inference-with-redpajama-incite-chat-3b-v1-gptq-4bit-128g.ipynb">
   <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
 </a>
-A GPTQ quantization of the [RedPajama-INCITE-Chat-3B-v1](https://huggingface.co/togethercomputer/RedPajama-INCITE-Chat-3B-v1) via auto-gptq.
 ## Usage
 > Note that you cannot load directly from the hub with `auto_gptq` yet - if needed you can use [this function](https://gist.github.com/pszemraj/8368cba3400bda6879e521a55d2346d0) to download using the repo name.
-install auto-GPTQ
 ```bash
 pip install ninja auto-gptq[triton]

 inference: false
 tags:
 - auto-gptq
+pipeline_tag: text-generation
 ---
+# redpajama gptq: RedPajama-INCITE-Chat-3B-v1
 <a href="https://colab.research.google.com/gist/pszemraj/86d2e8485df182302646ed2c5a637059/inference-with-redpajama-incite-chat-3b-v1-gptq-4bit-128g.ipynb">
   <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
 </a>
+A GPTQ quantization of the [RedPajama-INCITE-Chat-3B-v1](https://huggingface.co/togethercomputer/RedPajama-INCITE-Chat-3B-v1) via auto-gptq. Model file is only 2GB.
 ## Usage
 > Note that you cannot load directly from the hub with `auto_gptq` yet - if needed you can use [this function](https://gist.github.com/pszemraj/8368cba3400bda6879e521a55d2346d0) to download using the repo name.
+first install auto-GPTQ
 ```bash
 pip install ninja auto-gptq[triton]