Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
13
Follow
AWS Inferentia and Trainium
76
License:
apache-2.0
Model card
Files
Files and versions
Community
299
1d9c9d0
optimum-neuron-cache
/
neuronxcc-2.13.66.0+6dfecc895
/
0_REGISTRY
/
0.0.21
/
inference
/
llama
/
meta-llama
/
Llama-2-70b-chat-hf
8 contributors
History:
2 commits
dacorvo
HF staff
Synchronizing local compiler cache.
e5d5bbe
verified
10 months ago
1c0ffc384d07e27fbe8a.json
861 Bytes
Synchronizing local compiler cache.
10 months ago
8d49601d4e2484beb8d2.json
861 Bytes
Synchronizing local compiler cache.
10 months ago