llamafile
English
GGUF

Commit History

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_L.gguf to repo using llamafile a6d041a3b59582d2a43c5837cf170cccaa511180
6f661f1

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q2_K.gguf to repo using llamafile a6d041a3b59582d2a43c5837cf170cccaa511180
19dadc2

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q2_K.llamafile to repo
a86f873

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q2_K.gguf to repo using llamafile a6d041a3b59582d2a43c5837cf170cccaa511180
24db2be

jartine commited on

Add README.md to repo
5efcc75

jartine commited on

Add .gitattributes to repo
bfecdb2

jartine commited on

Upload TinyLlama-1.1B-Chat-v1.0.f16.gguf with huggingface_hub
271bc1e

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.f16.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
3cb317f

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q8_0.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
e7b8a51

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q6_K.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
d8f2ce6

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q5_K_S.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
0b0ec4f

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q5_K_M.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
4264cca

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q4_K_S.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
a3803f0

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q4_K_M.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
88ac6ed

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q4_0.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
c2ab9a9

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_S.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
19ec640

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_M.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
ae088eb

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q3_K_L.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
de943b0

jartine commited on

Add TinyLlama-1.1B-Chat-v1.0.Q2_K.gguf to repo using llamafile 1d9fa85f0c136d81c6684484c05582e3f4801b21
4324a07

jartine commited on

Add README for quantized weights
da72cd3

jartine commited on

Add README for quantized weights
1cc7774

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
18e69e7

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
25ae9a9

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
1757add

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
a09f5ff

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
623bf7e

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
734d0b9

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
1ca064a

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
4bf145d

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
32b626c

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
bc8f9a6

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
4df03a3

jartine commited on

Convert TinyLlama/TinyLlama-1.1B-Chat-v1.0 to GGUF weights using llamafile-quantize 1d9fa85f0c136d81c6684484c05582e3f4801b21
bb45d9d

jartine commited on

initial commit
1abeb5b

jartine commited on