Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RichardErkhov
/
nvidia_-_Llama-3.1-Nemotron-70B-Instruct-HF-gguf
like
0
GGUF
Inference Endpoints
conversational
Model card
Files
Files and versions
Community
Deploy
Use this model
ca65c9a
nvidia_-_Llama-3.1-Nemotron-70B-Instruct-HF-gguf
1 contributor
History:
19 commits
RichardErkhov
uploaded model
ca65c9a
verified
about 1 month ago
IQ4_NL
uploaded model
about 1 month ago
Q4_1
uploaded model
about 1 month ago
Q4_K
uploaded model
about 1 month ago
Q4_K_M
uploaded model
about 1 month ago
Q4_K_S
uploaded model
about 1 month ago
Q5_0
uploaded model
about 1 month ago
Q5_K
uploaded model
about 1 month ago
Q5_K_S
uploaded model
about 1 month ago
.gitattributes
3.99 kB
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.IQ3_M.gguf
31.9 GB
LFS
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.IQ3_S.gguf
30.9 GB
LFS
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.IQ3_XS.gguf
29.3 GB
LFS
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.IQ4_XS.gguf
38.3 GB
LFS
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.Q2_K.gguf
26.4 GB
LFS
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.Q3_K.gguf
34.3 GB
LFS
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.Q3_K_L.gguf
37.1 GB
LFS
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.Q3_K_M.gguf
34.3 GB
LFS
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.Q3_K_S.gguf
30.9 GB
LFS
uploaded model
about 1 month ago
Llama-3.1-Nemotron-70B-Instruct-HF.Q4_0.gguf
40 GB
LFS
uploaded model
about 1 month ago