PeterV09 commited on
Commit
a08b314
1 Parent(s): ddeec53

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -8,7 +8,7 @@ language:
8
 
9
  <img src="https://huggingface.co/datasets/hkust-nlp/deita-images/resolve/main/logo-final.png" alt="Deita banner" width="800" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
10
 
11
- # Model Card for Deita 7B V1.0 SFT (6k)
12
 
13
  Deita is an open-sourced project designed to facilitate **Automatic Data Selection** for instruction tuning in Large Language Models (LLMs).
14
  Deita 7B V1.0 SFT (6k) is a fine-tuned version of Mistral-7B-v0.1 that was trained on 6k automatically selected lightweight, high-quality alignment SFT data: [Deita 6K V0](https://huggingface.co/datasets/hkust-nlp/deita-6k-v0).
@@ -39,7 +39,7 @@ Deita 7B V1.0 SFT (6k) is a fine-tuned version of Mistral-7B-v0.1 that was train
39
  | OpenChat-3.5 | C-RLFT | >70K C-RLFT | 7.81 | 88.51 | -- |
40
  | Starling-7B | C-RLFT + APA | >70K C-RLFT + 183K APA | 8.09 | 91.99 | -- |
41
  | Random | SFT | 10K SFT | 5.89 | 56.90 | 61.72 |
42
- | DEITA-7B-v1.0-sft (6K) | SFT | 6K SFT | 7.22 | 80.78 | 64.94 |
43
  | DEITA-7B-v1.0-sft | SFT | 10K SFT | 7.32 | 81.67 | 64.00 |
44
  | DEITA-7B-v1.0 | SFT + DPO | 6K SFT + 10K DPO | 7.55 | 90.06 | 69.86 |
45
 
 
8
 
9
  <img src="https://huggingface.co/datasets/hkust-nlp/deita-images/resolve/main/logo-final.png" alt="Deita banner" width="800" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
10
 
11
+ # Model Card for Deita 7B V1.0 SFT
12
 
13
  Deita is an open-sourced project designed to facilitate **Automatic Data Selection** for instruction tuning in Large Language Models (LLMs).
14
  Deita 7B V1.0 SFT (6k) is a fine-tuned version of Mistral-7B-v0.1 that was trained on 6k automatically selected lightweight, high-quality alignment SFT data: [Deita 6K V0](https://huggingface.co/datasets/hkust-nlp/deita-6k-v0).
 
39
  | OpenChat-3.5 | C-RLFT | >70K C-RLFT | 7.81 | 88.51 | -- |
40
  | Starling-7B | C-RLFT + APA | >70K C-RLFT + 183K APA | 8.09 | 91.99 | -- |
41
  | Random | SFT | 10K SFT | 5.89 | 56.90 | 61.72 |
42
+ | DEITA-7B-v1.0-sft | SFT | 6K SFT | 7.22 | 80.78 | 64.94 |
43
  | DEITA-7B-v1.0-sft | SFT | 10K SFT | 7.32 | 81.67 | 64.00 |
44
  | DEITA-7B-v1.0 | SFT + DPO | 6K SFT + 10K DPO | 7.55 | 90.06 | 69.86 |
45