LegalLMs
Collection
XLM-RoBERTa models with continued pretraining on the MultiLegalPile
•
37 items
•
Updated
•
2
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
0.8218 | 8.01 | 50000 | 0.3052 |
0.8718 | 16.02 | 100000 | 0.2487 |
0.7884 | 24.03 | 150000 | 0.2277 |
0.625 | 33.0 | 200000 | 0.2205 |