Update README.md
Browse files
README.md
CHANGED
@@ -69,7 +69,7 @@ The model was trained on one A100 GPU with the following hyperparameters:
|
|
69 |
| `total_batch_size` | 256 (~130K tokens per step) |
|
70 |
| `num_epochs` | 4 |
|
71 |
|
72 |
-
More details about fine-tuning can be found in the technical report (coming soon).
|
73 |
|
74 |
# Fine-tuning data
|
75 |
|
|
|
69 |
| `total_batch_size` | 256 (~130K tokens per step) |
|
70 |
| `num_epochs` | 4 |
|
71 |
|
72 |
+
More details about fine-tuning can be found in the technical report (coming soon!).
|
73 |
|
74 |
# Fine-tuning data
|
75 |
|