awels
/

maximusLLM-3b-128k-gguf

Inference Endpoints

Model card Files Files and versions Community

fbeawels commited on Aug 11

Commit

9ca6f79

•

1 Parent(s): cbeedc0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -40,7 +40,7 @@ The dataset used to train Maximus consists of all the public documents available
 ## Training Details
 **Training Data:**
-The training data includes 16,000 Questions and Answers generated by the [Bonito LLM](https://github.com/BatsResearch/bonito). The dataset is split into 3 sets of data (training, test and validation) to ensure robust model performance.
 **Training Procedure:**
 Maximus was trained using supervised learning with cross-entropy loss and the Adam optimizer. The training involved 1 epoch, a batch size of 4, a learning rate of 5.0e-06, and a cosine learning rate scheduler with gradient checkpointing for memory efficiency.

 ## Training Details
 **Training Data:**
+The training data includes 67,000 Questions and Answers generated by the [Bonito LLM](https://github.com/BatsResearch/bonito). The dataset is split into 3 sets of data (training, test and validation) to ensure robust model performance.
 **Training Procedure:**
 Maximus was trained using supervised learning with cross-entropy loss and the Adam optimizer. The training involved 1 epoch, a batch size of 4, a learning rate of 5.0e-06, and a cosine learning rate scheduler with gradient checkpointing for memory efficiency.