StefanKrsteski
commited on
Commit
•
6c61771
1
Parent(s):
097e2d8
Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ language:
|
|
20 |
|
21 |
## Intended Use
|
22 |
|
23 |
-
- **Primary Applications:** This model is designed as part of an AI-Tutor system
|
24 |
- **Intended Audience:** Educators, students, and developers creating educational AI applications.
|
25 |
|
26 |
## Model/Data Description
|
@@ -28,9 +28,9 @@ language:
|
|
28 |
### Training Data
|
29 |
|
30 |
- **Datasets Used:**
|
31 |
-
- **Milestone 1 Dataset:** Includes
|
32 |
-
- **Stack Exchange Dataset:** Filters content from specific domains within the Stack Exchange network, using upvoted and downvoted answers to form preference pairs. Total entries:
|
33 |
-
- **Ultra Feedback:** Utilizes responses rated on criteria like truthfulness and helpfulness to form preference pairs, with a total of
|
34 |
- **Preprocessing Details:** Entries with identical chosen and rejected answers were removed. Datasets were formatted as JSONL where each line represents a JSON object with a "prompt", "chosen", and "rejected" response.
|
35 |
|
36 |
## Training Procedure
|
|
|
20 |
|
21 |
## Intended Use
|
22 |
|
23 |
+
- **Primary Applications:** This model is designed as part of an AI-Tutor system.
|
24 |
- **Intended Audience:** Educators, students, and developers creating educational AI applications.
|
25 |
|
26 |
## Model/Data Description
|
|
|
28 |
### Training Data
|
29 |
|
30 |
- **Datasets Used:**
|
31 |
+
- **Milestone 1 Dataset:** Includes 1522 unique questions with preference pairs based on the 'overall' rating, totaling 20k+ usable entries after processing.
|
32 |
+
- **Stack Exchange Dataset:** Filters content from specific domains within the Stack Exchange network, using upvoted and downvoted answers to form preference pairs. Total entries after preprocessing: 54458.
|
33 |
+
- **Ultra Feedback:** Utilizes responses rated on criteria like truthfulness and helpfulness to form preference pairs, with a total of 60917 entries after preprocessing.
|
34 |
- **Preprocessing Details:** Entries with identical chosen and rejected answers were removed. Datasets were formatted as JSONL where each line represents a JSON object with a "prompt", "chosen", and "rejected" response.
|
35 |
|
36 |
## Training Procedure
|