torch accelerate bitsandbytes transformers elasticsearch openai vllm