modelscope transformers torch einops accelerate tiktoken flash-attention transformers_stream_generator==0.0.4 peft deepspeed bitsandbytes safetensors sentencepiece scipy torch==2.0.1 torchaudio==2.0.2 torchvision==0.15.2 diffusers optimum auto-gptq