Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
14
Follow
AWS Inferentia and Trainium
77
License:
apache-2.0
Model card
Files
Files and versions
Community
304
aa77fe4
optimum-neuron-cache
/
neuronxcc-2.13.66.0+6dfecc895
/
0_REGISTRY
/
0.0.21
/
inference
/
llama
/
meta-llama
/
Llama-2-70b-chat-hf
8 contributors
History:
2 commits
dacorvo
HF staff
Synchronizing local compiler cache.
e5d5bbe
verified
10 months ago
1c0ffc384d07e27fbe8a.json
861 Bytes
Synchronizing local compiler cache.
10 months ago
8d49601d4e2484beb8d2.json
861 Bytes
Synchronizing local compiler cache.
10 months ago