hkust-nlp
/

deita-llama1-13b-v1.0-sft

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AndrewZeng commited on Dec 4, 2023

Commit

2a209b2

•

1 Parent(s): 7fb2ebc

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -25,7 +25,19 @@ Deita Llama1 13B V1.0 SFT is a fine-tuned version of Llama 1 that was trained on
 - **Model Family:** Other models and the dataset are found in the [Deita collection](https://huggingface.co/collections/hkust-nlp/deita-6569c198c174808d94cf5bd4).
 ## Performance
 ## Input Format

 - **Model Family:** Other models and the dataset are found in the [Deita collection](https://huggingface.co/collections/hkust-nlp/deita-6569c198c174808d94cf5bd4).
 ## Performance
+  | Model                                          | Align     | Data Size  | MT-Bench | AlpacaEval(%) | OpenLLM (Avg.) |
+|------------------------------------------------|-----------|------------|----------|---------------|----------------|
+| **Proprietary Models**                         |           |            |          |               |                |
+| GPT-4-Turbo                                    | ?         | --         | 9.32     | 97.70         | --             |
+| GPT-4                                          | SFT + PPO | --         | 8.99     | 95.03         | --             |
+| Claude-2                                       | SFT + PPO | --         | 8.06     | 91.36         | --             |
+| GPT-3.5-turbo                                  | SFT + PPO | --         | 7.94     | 89.37         | --             |
+| **Open-sourced Models based on LLaMA-1-13B**   |           |            |          |               |                |
+| LIMA                                           | SFT       | 1K SFT        | 4.29     | 41.98         | 59.82          |
+| WizardLM-13B                                   | SFT       | 70K SFT       | 6.35     | 75.31         | 58.96          |
+| Vicuna-13B-v1.3                                | SFT       | 125K SFT      | 6.39     | 82.11         | 60.01          |
+| Random                                         | SFT       | 10K SFT       | 6.03     | 71.52         | 60.14          |
+| DEITA-LLaMA1-13B-v1.0-sft                           | SFT       | 10K SFT       | 6.60     | 78.01         | 64.27          |
 ## Input Format