cstr
/

llama3.1-8b-spaetzle-v51-GGUF-1

Model card Files Files and versions Community

cstr commited on Jul 26

Commit

4a24bcb

•

1 Parent(s): a8ddb0f

Update README.md

Files changed (1) hide show

README.md +1 -4

README.md CHANGED Viewed

@@ -14,10 +14,7 @@ language:
 This is only a quick test in merging 3 and 3.1 llamas despite a number of differences in tokenizer setup i.a., also motivated by ongoing problems with BOS, looping, etc, with 3.1, esp. with llama.cpp, missing full RoPE scaling yet, etc. Performance is yet not satisfactory of course, which might have a number of causes.
-GGUF is (for another test purpose) done with old llama.cpp binary (b2750) and
-``` code
---leave-output-tensor --token-embedding-type f16.
-```
 ### Summary Table

 This is only a quick test in merging 3 and 3.1 llamas despite a number of differences in tokenizer setup i.a., also motivated by ongoing problems with BOS, looping, etc, with 3.1, esp. with llama.cpp, missing full RoPE scaling yet, etc. Performance is yet not satisfactory of course, which might have a number of causes.
+GGUF is (for another test purpose) done with old llama.cpp binary (b2750).
 ### Summary Table