Update README.md
Browse files
README.md
CHANGED
@@ -64,4 +64,18 @@ pipeline = transformers.pipeline(
|
|
64 |
|
65 |
outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
|
66 |
print(outputs[0]["generated_text"])
|
67 |
-
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
64 |
|
65 |
outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
|
66 |
print(outputs[0]["generated_text"])
|
67 |
+
```
|
68 |
+
|
69 |
+
## Evaluations
|
70 |
+
|
71 |
+
Evaluations done using mlabonne's usefull [Colab notebook llm-autoeval](https://github.com/mlabonne/llm-autoeval).
|
72 |
+
Also check out the alternative leaderboard at [Yet_Another_LLM_Leaderboard](https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard)
|
73 |
+
|
74 |
+
[phizzle](https://huggingface.co/Isotonic/phizzle) - Yet to be benchmarked
|
75 |
+
|
76 |
+
| Model |AGIEval|GPT4All|TruthfulQA|Bigbench|Average|
|
77 |
+
|----------------------------------------------------------------|------:|------:|---------:|-------:|------:|
|
78 |
+
|[phi-2-orange](https://huggingface.co/rhysjones/phi-2-orange)| **33.37**| 71.33| 49.87| **37.3**| **47.97**|
|
79 |
+
|[phi-2-dpo](https://huggingface.co/lxuechen/phi-2-dpo)| 30.39| **71.68**| **50.75**| 34.9| 46.93|
|
80 |
+
|[dolphin-2_6-phi-2](https://huggingface.co/cognitivecomputations/dolphin-2_6-phi-2)| 33.12| 69.85| 47.39| 37.2| 46.89|
|
81 |
+
|[phi-2](https://huggingface.co/microsoft/phi-2)| 27.98| 70.8| 44.43| 35.21| 44.61|
|