Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
13
Follow
AWS Inferentia and Trainium
64
License:
apache-2.0
Model card
Files
Files and versions
Community
255
c9f4999
optimum-neuron-cache
/
inference-cache-config
Commit History
Add Zephyr to mistral variants
9164704
verified
dacorvo
HF staff
commited on
Mar 21
Remove variants from main mistral config
ef07aca
verified
dacorvo
HF staff
commited on
Mar 21
Add mistral most popular variants
d3983e8
verified
dacorvo
HF staff
commited on
Mar 21
Add most popular llama variants
594abb2
verified
dacorvo
HF staff
commited on
Mar 21
Added teknium/OpenHermes-2.5-Mistral-7B
1518247
verified
dacorvo
HF staff
commited on
Mar 8
Added Llama-70b batch_size 4 to inference cache
593822e
verified
dacorvo
HF staff
commited on
Mar 8
Create mistral.json
b5d0afd
verified
philschmid
HF staff
commited on
Mar 5
Create gpt2.json
3bdb891
verified
philschmid
HF staff
commited on
Mar 5
Create inference-cache-config/llama.json
1960ccb
verified
philschmid
HF staff
commited on
Mar 5