MT7Bi-dpo / README.md
satyamt's picture
Update README.md
50e9d61 verified
---
license: mit
language:
- en
base_model: Technoculture/MT7Bi-sft
datasets:
- Technoculture/MT7Bi-alpha-dpo-v0.2
---
# MT7Bi-dpo
![image/png](https://cdn-uploads.huggingface.co/production/uploads/63486df1f8f01fcc4b23e97d/l0gsJM9flvOefrWTe6Y5f.png)
[Technoculture/MT7Bi-sft (base)](https://huggingface.co/Technoculture/MT7Bi-sft) + [Technoculture/MT7Bi-alpha-dpo-v0.2 (adapter)](https://huggingface.co/Technoculture/MT7Bi-alpha-dpo-v0.2)
# Open LLM Leaderboard
![image/png](https://cdn-uploads.huggingface.co/production/uploads/63486df1f8f01fcc4b23e97d/sNNjkE0Voy7k9cEhTPly9.png)
| Model Name | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
| ------------------ | -------- | --------- | ---- | ---------- | ---------- | -------- |
| Orca-2-7b | **78.4** | 76.1 | 53.7 | **52.4** | **74.2** | **47.2** |
| LLAMA-2-7b | 43.2 | **77.1** | 44.4 | 38.7 | 69.5 | 16 |
| MT7Bi-sft | 54.1 | 75.11 | - | 43.08 | 72.14 | 15.54 |
| MT7bi-dpo | 54.69 | 75.89 | 52.82 | 45.48 | 71.58 | 25.93 |