Using the fastest-inference-4bit branch