Update README.md
Browse files
README.md
CHANGED
@@ -40,7 +40,7 @@ The dataset used to train Maximus consists of all the public documents available
|
|
40 |
## Training Details
|
41 |
|
42 |
**Training Data:**
|
43 |
-
The training data includes
|
44 |
|
45 |
**Training Procedure:**
|
46 |
Maximus was trained using supervised learning with cross-entropy loss and the Adam optimizer. The training involved 1 epoch, a batch size of 4, a learning rate of 5.0e-06, and a cosine learning rate scheduler with gradient checkpointing for memory efficiency.
|
|
|
40 |
## Training Details
|
41 |
|
42 |
**Training Data:**
|
43 |
+
The training data includes 67,000 Questions and Answers generated by the [Bonito LLM](https://github.com/BatsResearch/bonito). The dataset is split into 3 sets of data (training, test and validation) to ensure robust model performance.
|
44 |
|
45 |
**Training Procedure:**
|
46 |
Maximus was trained using supervised learning with cross-entropy loss and the Adam optimizer. The training involved 1 epoch, a batch size of 4, a learning rate of 5.0e-06, and a cosine learning rate scheduler with gradient checkpointing for memory efficiency.
|