Update README.md
Browse files
README.md
CHANGED
@@ -70,9 +70,9 @@ This comprehensive training helps the model address a broad spectrum of hospital
|
|
70 |
|
71 |
- **Optimizer**: AdamW
|
72 |
- **Learning Rate**: 0.0002 with a cosine learning rate scheduler
|
73 |
-
- **Epochs**:
|
74 |
- **Batch Size**: 4
|
75 |
-
- **Gradient Accumulation Steps**:
|
76 |
- **Maximum Sequence Length**: 8192 tokens
|
77 |
|
78 |
### Environment
|
|
|
70 |
|
71 |
- **Optimizer**: AdamW
|
72 |
- **Learning Rate**: 0.0002 with a cosine learning rate scheduler
|
73 |
+
- **Epochs**: 4
|
74 |
- **Batch Size**: 4
|
75 |
+
- **Gradient Accumulation Steps**: 4
|
76 |
- **Maximum Sequence Length**: 8192 tokens
|
77 |
|
78 |
### Environment
|