shisa-ai
/

shisa-v1-llama3-70b-gguf

Inference Endpoints

Model card Files Files and versions Community

leonardlin commited on May 25

Commit

7e34145

•

1 Parent(s): 57f36e6

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -4,6 +4,22 @@
 See https://huggingface.co/shisa-ai/shisa-v1-llama3-70b for the working model
 Quick and dirty GGUF quants. Maybe some iMatrix soon. BF16 conversion included in this repo.
 split:

 See https://huggingface.co/shisa-ai/shisa-v1-llama3-70b for the working model
+First turn seems to work well (eg, benchmarks fine) but after about turn three, the model starts to output random tokens...
+## Performance
+Measured using a [fork](https://github.com/shisa-ai/shaberi) of [Lightblue's Shaberi benchmark framework](https://github.com/lightblue-tech/japanese_llm_eval):
+## Performance
+Measured using a [fork](https://github.com/shisa-ai/shaberi) of [Lightblue's Shaberi benchmark framework](https://github.com/lightblue-tech/japanese_llm_eval):
+| Model                                  | Average | ELYZA-tasks-100 | MT-Bench | Rakuda | Tengu-Bench |
+|----------------------------------------|---------|-----------------|----------|--------|-------------|
+| **shisa-ai/shisa-v1-llama3-70b**       | **7.30**| **7.34**        | **7.67** | **8.15** | **6.04**  |
+| **shisa-ai/shisa-v1-llama3-70b.Q4_K_M**| **7.22**| **7.22**        | **7.27** | **8.20** | **6.19**  |
+---
 Quick and dirty GGUF quants. Maybe some iMatrix soon. BF16 conversion included in this repo.
 split: