Xenon1
/

Zenith-7B-dpo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

gagan3012 commited on Feb 15

Commit

09ac0c5

•

1 Parent(s): a208869

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -11,6 +11,9 @@ pipeline_tag: text-generation
 Mistral-7B-v0.1 model fine-tuned on the Ultrafeedback dataset using techinques shown in the paper [Self-Rewarding Language Models](https://arxiv.org/abs/2401.10020).
 ## Instruction format
 In order to leverage instruction fine-tuning, your prompt should be surrounded by `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.

 Mistral-7B-v0.1 model fine-tuned on the Ultrafeedback dataset using techinques shown in the paper [Self-Rewarding Language Models](https://arxiv.org/abs/2401.10020).
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/60394599033b61166496163b/x50p_gQtQMb0fFVY8MGeq.png)
 ## Instruction format
 In order to leverage instruction fine-tuning, your prompt should be surrounded by `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.