Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
TheBloke
/
Yarn-Llama-2-70B-32k-GGUF
like
9
Transformers
GGUF
emozilla/yarn-train-tokenized-8k-llama
English
llama
arxiv:
2309.00071
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
f3ed1cf
Yarn-Llama-2-70B-32k-GGUF
1 contributor
History:
25 commits
TheBloke
Upload README.md
f3ed1cf
about 1 year ago
.gitattributes
2.5 kB
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit 0b871f1)
about 1 year ago
LICENSE.txt
7.02 kB
Add Llama 2 license files
about 1 year ago
Notice
112 Bytes
Add Llama 2 license files
about 1 year ago
README.md
20.9 kB
Upload README.md
about 1 year ago
USE_POLICY.md
4.77 kB
Add Llama 2 license files
about 1 year ago
config.json
29 Bytes
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q2_K.gguf
29.3 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q3_K_L.gguf
36.1 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q3_K_M.gguf
33.2 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q3_K_S.gguf
29.9 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q4_0.gguf
38.9 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q4_K_M.gguf
41.4 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q4_K_S.gguf
39.1 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q5_0.gguf
47.5 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q5_K_M.gguf
48.8 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q5_K_S.gguf
47.5 GB
LFS
GGUF model commit (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q6_K.gguf-split-a
28.3 GB
LFS
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q6_K.gguf-split-b
28.3 GB
LFS
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q8_0.gguf-split-a
36.6 GB
LFS
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit 0b871f1)
about 1 year ago
yarn-llama-2-70b-32k.Q8_0.gguf-split-b
36.6 GB
LFS
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit 0b871f1)
about 1 year ago