gradio transformers requests accelerate tiktoken einops transformers_stream_generator==0.0.4 scipy torchvision tensorboard matplotlib bitsandbytes optimum auto-gptq mdtex2html packaging ninja