transformers gradio torch bitsandbytes accelerate autoawq huggingface_hub llama-cpp-python