[Cache Request] aaditya/Llama3-OpenBioLLM-8B

#106

by sagarjethi - opened Jun 18, 2024

Discussion

sagarjethi

Jun 18, 2024

Please add the following model to the neuron cache

dacorvo

AWS Inferentia and Trainium org Jun 18, 2024

I think that if you just edit the config.json to set use_cache to True, then the config is identical to the meta-llama/Meta-LLama-3-8B config and the model will be detected as cached.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment