Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-8B-Instruct-quantized.w4a16
like
23
Follow
Neural Magic
260
Text Generation
Transformers
Safetensors
8 languages
llama
int4
vllm
conversational
text-generation-inference
Inference Endpoints
4-bit precision
gptq
License:
llama3.1
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
72eb322
Meta-Llama-3.1-8B-Instruct-quantized.w4a16
Commit History
Update README.md
72eb322
verified
alexmarques
commited on
Oct 10
Update README.md
153436a
verified
alexmarques
commited on
Oct 10
Update README.md
764531e
verified
alexmarques
commited on
Sep 30
Update README.md
5b554e0
verified
alexmarques
commited on
Sep 30
Upload tokenizer.json with huggingface_hub
1455f0f
verified
alexmarques
commited on
Sep 30
Update README.md
8f210e8
verified
alexmarques
commited on
Sep 30
Update README.md
ee2cc51
verified
alexmarques
commited on
Sep 30
Upload tokenizer_config.json with huggingface_hub
2886071
verified
alexmarques
commited on
Sep 27
Update README.md
45a8720
verified
alexmarques
commited on
Sep 10
Update README.md
8ecfb5a
verified
alexmarques
commited on
Aug 13
Upload folder using huggingface_hub
46cdbb6
verified
alexmarques
commited on
Aug 13
Update README.md
c8857c0
verified
alexmarques
commited on
Aug 13
Update README.md
0dbc179
verified
abhinavnmagic
commited on
Aug 8
Update README.md
34fbd4c
verified
alexmarques
commited on
Jul 30
Create README.md
14bf365
verified
abhinavnmagic
commited on
Jul 26
Upload folder using huggingface_hub
822ee82
verified
abhinavnmagic
commited on
Jul 26
initial commit
e838ba6
verified
abhinavnmagic
commited on
Jul 26