gemma-2-2b-it-GPTQ / README.md
shuyuej's picture
Update README.md
75ab37c verified
|
raw
history blame
682 Bytes
---
license: apache-2.0
---
# The Quantized Google Gemma 2 2B Instruction Tuned Model
Original Base Model: `google/gemma-2-2b-it`.<br>
Link: [https://huggingface.co/google/gemma-2-2b-it](https://huggingface.co/google/gemma-2-2b-it)
## Quantization Configurations
```
"quantization_config": {
"batch_size": 1,
"bits": 4,
"block_name_to_quantize": null,
"cache_block_outputs": true,
"damp_percent": 0.1,
"dataset": null,
"desc_act": false,
"exllama_config": {
"version": 1
},
```
## Source Codes
Source Codes: [https://github.com/vkola-lab/medpodgpt/tree/main/quantization](https://github.com/vkola-lab/medpodgpt/tree/main/quantization).