GameScribes
/

stella_en_400M_v5

Model card Files Files and versions Community

devve1 commited on 23 days ago

Commit

3adaa4f

•

1 Parent(s): 9a760ec

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -3,6 +3,9 @@ Re-Upload of https://huggingface.co/dunzhang/stella_en_400M_v5 with :
 - Max tokens lenght to 512 ( model has been trained on this sequence lenght )
 - Padding strategy set to "BatchLongest"
 Parameters at the end of the file "config.json" has been set manually to false for CPU usage:
 ```"unpad_inputs": false, "use_memory_efficient_attention": false```
 You can turn them back to "true" to enable GPU back again

 - Max tokens lenght to 512 ( model has been trained on this sequence lenght )
 - Padding strategy set to "BatchLongest"
 Parameters at the end of the file "config.json" has been set manually to false for CPU usage:
 ```"unpad_inputs": false, "use_memory_efficient_attention": false```
 You can turn them back to "true" to enable GPU back again