metadata
license: apache-2.0
The Quantized Google Gemma 2 2B Instruction Tuned Model
Original Base Model: google/gemma-2-2b-it
.
Link: https://huggingface.co/google/gemma-2-2b-it
Quantization Configurations
"quantization_config": {
"batch_size": 1,
"bits": 4,
"block_name_to_quantize": null,
"cache_block_outputs": true,
"damp_percent": 0.1,
"dataset": null,
"desc_act": false,
"exllama_config": {
"version": 1
},
Source Codes
Source Codes: https://github.com/vkola-lab/medpodgpt/tree/main/quantization.