BlouseJury
/

Mistral-7B-Discord-0.1-DPO

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

BlouseJury commited on Jan 29

Commit

3fde205

•

1 Parent(s): 8643c31

Update README.md

Files changed (1) hide show

README.md +3 -18

README.md CHANGED Viewed

@@ -4,13 +4,10 @@ base_model: BlouseJury/Mistral-7B-Discord-0.1
 tags:
 - generated_from_trainer
 model-index:
-- name: out
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
 <details><summary>See axolotl config</summary>
@@ -89,24 +86,12 @@ special_tokens:
 </details><br>
-# out
-This model is a fine-tuned version of [BlouseJury/Mistral-7B-Discord-0.1](https://huggingface.co/BlouseJury/Mistral-7B-Discord-0.1) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.7923
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters

 tags:
 - generated_from_trainer
 model-index:
+- name: Mistral-7B-Discord-0.1-DPO
   results: []
 ---
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
 <details><summary>See axolotl config</summary>
 </details><br>
+# BlouseJury/Mistral-7B-Discord-0.1-DPO
+This model is a fine-tuned version of [BlouseJury/Mistral-7B-Discord-0.1](https://huggingface.co/BlouseJury/Mistral-7B-Discord-0.1) on the Intel/orca_dpo_pairs dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.7923
 ## Training procedure
 ### Training hyperparameters