abhishekchohan
commited on
Commit
•
a610c8f
1
Parent(s):
750e329
Update README.md
Browse files
README.md
CHANGED
@@ -10,14 +10,14 @@ library_name: transformers
|
|
10 |
pipeline_tag: text-generation
|
11 |
---
|
12 |
|
13 |
-
|
14 |
Introducing Mistral-7B-Forest-DPO, a LLM fine-tuned with base model mistralai/Mistral-7B-v0.1, using direct preference optimization.
|
15 |
This model showcases exceptional prowess across a spectrum of natural language processing (NLP) tasks.
|
16 |
|
17 |
A mixture of the following datasets was used for fine-tuning.
|
18 |
-
|
19 |
-
|
20 |
-
|
21 |
|
22 |
💻 Usage
|
23 |
|
|
|
10 |
pipeline_tag: text-generation
|
11 |
---
|
12 |
|
13 |
+
### Mistral-7B-Forest-DPO
|
14 |
Introducing Mistral-7B-Forest-DPO, a LLM fine-tuned with base model mistralai/Mistral-7B-v0.1, using direct preference optimization.
|
15 |
This model showcases exceptional prowess across a spectrum of natural language processing (NLP) tasks.
|
16 |
|
17 |
A mixture of the following datasets was used for fine-tuning.
|
18 |
+
Intel/orca_dpo_pairs
|
19 |
+
nvidia/HelpSteer
|
20 |
+
jondurbin/truthy-dpo-v0.1
|
21 |
|
22 |
💻 Usage
|
23 |
|