Update inference-cache-config/llama-variants.json e7179a3 verified dacorvo HF staff commited on Jun 27
Rename inference-cache-config/llama2.json to inference-cache-config/llama2-7b-13b.json be28bda verified dacorvo HF staff commited on Jun 27
Rename inference-cache-config/llama3.json to inference-cache-config/llama3-8b.json 06bc70d verified dacorvo HF staff commited on Jun 27
Add more batch_size for mistral on smaller instances 545cd4d verified dacorvo HF staff commited on May 31
Use princeton-nlp/Sheared-LLaMA-1.3B as a test model 695b341 verified dacorvo HF staff commited on May 30
Rename inference-cache-config/llama.json to inference-cache-config/llama2.json f06a55a verified dacorvo HF staff commited on Apr 19
Create stable-diffusion.json (#43) 32561fe verified philschmid HF staff Jingya HF staff commited on Apr 4