LegalLMs
Collection
XLM-RoBERTa models with continued pretraining on the MultiLegalPile
•
37 items
•
Updated
•
3
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
0.5892 | 228.0 | 50000 | 0.7659 |
0.4497 | 456.0 | 100000 | 0.7421 |
0.3906 | 684.0 | 150000 | 0.7443 |
0.3906 | 913.0 | 200000 | 0.7328 |