gagan3012 commited on
Commit
09ac0c5
1 Parent(s): a208869

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -11,6 +11,9 @@ pipeline_tag: text-generation
11
 
12
  Mistral-7B-v0.1 model fine-tuned on the Ultrafeedback dataset using techinques shown in the paper [Self-Rewarding Language Models](https://arxiv.org/abs/2401.10020).
13
 
 
 
 
14
  ## Instruction format
15
 
16
  In order to leverage instruction fine-tuning, your prompt should be surrounded by `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.
 
11
 
12
  Mistral-7B-v0.1 model fine-tuned on the Ultrafeedback dataset using techinques shown in the paper [Self-Rewarding Language Models](https://arxiv.org/abs/2401.10020).
13
 
14
+
15
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/60394599033b61166496163b/x50p_gQtQMb0fFVY8MGeq.png)
16
+
17
  ## Instruction format
18
 
19
  In order to leverage instruction fine-tuning, your prompt should be surrounded by `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.