Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
14
Follow
AWS Inferentia and Trainium
81
License:
apache-2.0
Model card
Files
Files and versions
Community
329
1fd222b
optimum-neuron-cache
/
neuronxcc-2.15.128.0+56dc5a86
/
0_REGISTRY
/
0.0.25.dev0
/
inference
/
llama
/
princeton-nlp
Commit History
Synchronizing local compiler cache.
68f4e48
verified
dacorvo
HF staff
commited on
Sep 20, 2024
Synchronizing local compiler cache.
49f80aa
verified
dacorvo
HF staff
commited on
Sep 20, 2024