rayliuca
/

TRagx-GPTQ-internlm2-7b

Text2Text Generation

feature-extraction

4-bit precision

Model card Files Files and versions Community

rayliuca commited on Feb 23

Commit

948ea6b

•

1 Parent(s): 2795885

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -14,6 +14,14 @@ pipeline_tag: text2text-generation
 Merged and GPTQ quantized version of [rayliuca/TRagx-internlm2-7b](https://huggingface.co/rayliuca/TRagx-internlm2-7b)
 ## GPTQ Dataset

 Merged and GPTQ quantized version of [rayliuca/TRagx-internlm2-7b](https://huggingface.co/rayliuca/TRagx-internlm2-7b)
+Note: I'm having some difficulties quantizing the models using GPTQ. Mistral and NeuralOmniBeagle's GPTQ models have significantly degraded output, while quantized TowerInstruct v0.2 was not working out right
+While this quantized model for InternLM2 seems to work all right, the translation accuracy is not validated.
+These AWQ quantized models are recommended:
+- [rayliuca/TRagx-AWQ-NeuralOmniBeagle-7B](https://huggingface.co/rayliuca/TRagx-AWQ-NeuralOmniBeagle-7B)
+- [rayliuca/TRagx-AWQ-Mistral-7B-Instruct-v0.2](https://huggingface.co/rayliuca/TRagx-AWQ-Mistral-7B-Instruct-v0.2)
 ## GPTQ Dataset