StefanKrsteski
commited on
Commit
•
2aa575c
1
Parent(s):
5a5d35c
Update README.md
Browse files
README.md
CHANGED
@@ -5,13 +5,13 @@ datasets:
|
|
5 |
- argilla/ultrafeedback-binarized-preferences-cleaned
|
6 |
- >-
|
7 |
flax-sentence-embeddings/stackexchange_titlebody_best_and_down_voted_answer_jsonl
|
|
|
|
|
8 |
---
|
9 |
|
10 |
# Model Card for Model ID
|
11 |
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
|
16 |
## Model Details
|
17 |
|
|
|
5 |
- argilla/ultrafeedback-binarized-preferences-cleaned
|
6 |
- >-
|
7 |
flax-sentence-embeddings/stackexchange_titlebody_best_and_down_voted_answer_jsonl
|
8 |
+
language:
|
9 |
+
- en
|
10 |
---
|
11 |
|
12 |
# Model Card for Model ID
|
13 |
|
14 |
+
Phi-3-mini-4k-instruct aligned using trl DPO on three datasets: EPFL-MNLP course (not yet publicly available), stackexchange (STEM only) and ultrafeedback.
|
|
|
|
|
15 |
|
16 |
## Model Details
|
17 |
|