tenyx
/

TenyxChat-7B-v1

Text Generation

tenyx-fine-tuning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sarath-shekkizhar commited on Jan 9

Commit

0ff1c54

•

1 Parent(s): 7307326

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -96,6 +96,17 @@ MT-Bench is a benchmark made up of 80 high-quality multi-turn questions. These q
 ![hexplot.png](assets/hexplot.png)
 ## LM Evaluation - Open LLM Leaderboard
 We assess models on 7 benchmarks using the [Eleuther AI Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness). This setup is based of that used for [Open LLM Leaderboard.](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)

 ![hexplot.png](assets/hexplot.png)
+### Comparison with additional Open LLM LeaderBoard models
+| Model | First Turn | Second Turn | Average |
+| --- | --- | --- | --- |
+| TenyxChat-7B-v1 | 8.45000 | 7.756250 | 8.103125 |
+| SamirGPT-v1 | 8.05000 | 7.612500 | 7.831250 |
+| FernandoGPT-v1 | 8.08125 | 7.256250 | 7.668750 |
+| Go-Bruins-v2 | 8.13750 | 7.150000 | 7.643750 |
+| mistral_tv-neural-marconroni | 7.76875 | 6.987500 | 7.378125 |
+| neuronovo-7B-v0.2 | 7.73750 | 6.662500 | 7.200000 |
+| neural-chat-7b-v3-3 | 7.39375 | 5.881250 | 6.637500 |
 ## LM Evaluation - Open LLM Leaderboard
 We assess models on 7 benchmarks using the [Eleuther AI Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness). This setup is based of that used for [Open LLM Leaderboard.](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)