Philip May
commited on
Commit
·
46a949f
1
Parent(s):
cf598af
Update README.md
Browse files
README.md
CHANGED
@@ -12,3 +12,9 @@ tags:
|
|
12 |
|
13 |
# Evaluation
|
14 |
see <https://github.com/GermanT5/german-t5-eval>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
|
13 |
# Evaluation
|
14 |
see <https://github.com/GermanT5/german-t5-eval>
|
15 |
+
|
16 |
+
# Tips for training on GPUs
|
17 |
+
This model is too big to fit on a normal 16GB GPU in FP32 mode.
|
18 |
+
For various reasons, T5 models cannot be trained in FP16 mode.
|
19 |
+
However, mixed precision training is not yet supported on many GPUs.
|
20 |
+
For example, it does not work on V100 GPUs. On A100, however, it does.
|