devve1 commited on
Commit
3adaa4f
1 Parent(s): 9a760ec

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -3,6 +3,9 @@ Re-Upload of https://huggingface.co/dunzhang/stella_en_400M_v5 with :
3
  - Max tokens lenght to 512 ( model has been trained on this sequence lenght )
4
  - Padding strategy set to "BatchLongest"
5
 
 
 
6
  Parameters at the end of the file "config.json" has been set manually to false for CPU usage:
7
  ```"unpad_inputs": false, "use_memory_efficient_attention": false```
 
8
  You can turn them back to "true" to enable GPU back again
 
3
  - Max tokens lenght to 512 ( model has been trained on this sequence lenght )
4
  - Padding strategy set to "BatchLongest"
5
 
6
+
7
+
8
  Parameters at the end of the file "config.json" has been set manually to false for CPU usage:
9
  ```"unpad_inputs": false, "use_memory_efficient_attention": false```
10
+
11
  You can turn them back to "true" to enable GPU back again