Update README.md
Browse files
README.md
CHANGED
@@ -47,7 +47,7 @@ Intented for research purposes.
|
|
47 |
|
48 |
## Training and evaluation data
|
49 |
|
50 |
-
Data from [oliverguhr/spelling](https://github.com/oliverguhr/spelling), with a "proofread: " prefix
|
51 |
|
52 |
The model was only evaluated on the first 100 test examples only during training.
|
53 |
|
@@ -56,7 +56,7 @@ The model was only evaluated on the first 100 test examples only during training
|
|
56 |
### Training hyperparameters
|
57 |
|
58 |
The following hyperparameters were used during training:
|
59 |
-
- learning_rate: 0.0001
|
60 |
- train_batch_size: 4
|
61 |
- eval_batch_size: 4
|
62 |
- seed: 42
|
|
|
47 |
|
48 |
## Training and evaluation data
|
49 |
|
50 |
+
Data from [oliverguhr/spelling](https://github.com/oliverguhr/spelling), with a "proofread: " prefix added to every example.
|
51 |
|
52 |
The model was only evaluated on the first 100 test examples only during training.
|
53 |
|
|
|
56 |
### Training hyperparameters
|
57 |
|
58 |
The following hyperparameters were used during training:
|
59 |
+
- learning_rate: 0.0001 (probably too high)
|
60 |
- train_batch_size: 4
|
61 |
- eval_batch_size: 4
|
62 |
- seed: 42
|