satyaalmasian
/

temporal_tagger_bert2bert

Text2Text Generation

encoder-decoder

Inference Endpoints

Model card Files Files and versions Community

satyaalmasian commited on Sep 21, 2021

Commit

a2ee042

·

1 Parent(s): 6feb3d6

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -44,7 +44,7 @@ For Pretraining :1 million weakly annotated samples from heideltime. The samples
 Fine-tunning: [Tempeval-3](https://www.cs.york.ac.uk/semeval-2013/task1/index.php%3Fid=data.html), Wikiwars, Tweets datasets. For the correct data versions please refer to our [repository](https://github.com/satya77/Transformer_Temporal_Tagger).
 #Training procedure
- The model is pre-trained on the weakly labeled data for $3$ epochs on the train set, from publicly available checkpoints on huggingface (`roberta-base`), with a batch size of 12. We use a learning rate of 5e-05 with an Adam optimizer and linear weight decay.
 Additionally, we use 2000 warmup steps.
 We fine-tune the 3 benchmark data for 8 epochs with 5 different random seeds, this version of the model is the only seed=4.
 The batch size and the learning rate is the same as the pre-training setup, but the warm-up steps are reduced to 100.

 Fine-tunning: [Tempeval-3](https://www.cs.york.ac.uk/semeval-2013/task1/index.php%3Fid=data.html), Wikiwars, Tweets datasets. For the correct data versions please refer to our [repository](https://github.com/satya77/Transformer_Temporal_Tagger).
 #Training procedure
+ The model is pre-trained on the weakly labeled data for $3$ epochs on the train set, from publicly available checkpoints on huggingface (`bert-base-uncased`), with a batch size of 12. We use a learning rate of 5e-05 with an Adam optimizer and linear weight decay.
 Additionally, we use 2000 warmup steps.
 We fine-tune the 3 benchmark data for 8 epochs with 5 different random seeds, this version of the model is the only seed=4.
 The batch size and the learning rate is the same as the pre-training setup, but the warm-up steps are reduced to 100.