Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
legraphista
/
Llama-3.1-Minitron-4B-Width-Base-GGUF
like
13
Text Generation
GGUF
quantized
GGUF
quantization
static
16bit
8bit
6bit
5bit
4bit
3bit
2bit
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
Use this model
1981fc8
Llama-3.1-Minitron-4B-Width-Base-GGUF
1 contributor
History:
14 commits
legraphista
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_K_S.gguf with huggingface_hub
1981fc8
verified
3 months ago
.gitattributes
2.07 kB
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_K_S.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q2_K.gguf
1.84 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q2_K.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q3_K.gguf
2.3 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q3_K.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_K.gguf
2.78 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_K.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q5_K.gguf
3.23 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_K.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q5_K_S.gguf
3.16 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q5_K_S.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q6_K.gguf
3.71 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q6_K.gguf with huggingface_hub
3 months ago
Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf
4.8 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q8_0.gguf with huggingface_hub
3 months ago
README.md
6.97 kB
Upload README.md with huggingface_hub
3 months ago