Add more batch_size for mistral on smaller instances 545cd4d verified dacorvo HF staff commited on May 31, 2024
Use princeton-nlp/Sheared-LLaMA-1.3B as a test model 695b341 verified dacorvo HF staff commited on May 30, 2024