https://arxiv.org/pdf/2403.08715
The following bitsandbytes quantization config was used during training:
bitsandbytes