Update inference-cache-config/llama-variants.json e7179a3 verified dacorvo HF staff commited on Jun 27
Use princeton-nlp/Sheared-LLaMA-1.3B as a test model 695b341 verified dacorvo HF staff commited on May 30