Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
like
14
Follow
Neural Magic
302
Text Generation
Transformers
Safetensors
8 languages
llama
int8
vllm
conversational
text-generation-inference
Inference Endpoints
8-bit precision
compressed-tensors
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
2bfe93f
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Commit History
Update README.md
2bfe93f
verified
alexmarques
commited on
Aug 13, 2024
Update README.md
44910a5
verified
alexmarques
commited on
Aug 7, 2024
Update README.md
e94a1df
verified
alexmarques
commited on
Aug 7, 2024
Update README.md
41fa77c
verified
alexmarques
commited on
Aug 7, 2024
Update README.md
e191b8d
verified
alexmarques
commited on
Jul 30, 2024
Update README.md
8f89d5f
verified
alexmarques
commited on
Jul 26, 2024
Update README.md
a5278f0
verified
alexmarques
commited on
Jul 24, 2024
Update README.md
be01205
verified
alexmarques
commited on
Jul 24, 2024
Update README.md
8e4a37a
verified
alexmarques
commited on
Jul 24, 2024
Create README.md
fa37030
verified
alexmarques
commited on
Jul 24, 2024
Upload folder using huggingface_hub
25b9a14
verified
alexmarques
commited on
Jul 24, 2024
initial commit
8599c95
verified
alexmarques
commited on
Jul 24, 2024