eddieman78 commited on
Commit
ee84dbc
1 Parent(s): 28de3f7
Files changed (2) hide show
  1. README.md +4 -4
  2. training_args.bin +1 -1
README.md CHANGED
@@ -1,8 +1,8 @@
1
  ---
2
  license: apache-2.0
 
3
  tags:
4
  - generated_from_trainer
5
- base_model: google/flan-t5-large
6
  model-index:
7
  - name: events-mem-large
8
  results: []
@@ -38,8 +38,8 @@ The following hyperparameters were used during training:
38
  - train_batch_size: 1
39
  - eval_batch_size: 1
40
  - seed: 42
41
- - gradient_accumulation_steps: 4
42
- - total_train_batch_size: 4
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
  - training_steps: 1
@@ -48,7 +48,7 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:------:|:----:|:---------------:|
51
- | 0.0 | 0.0008 | 1 | nan |
52
 
53
 
54
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: google/flan-t5-large
4
  tags:
5
  - generated_from_trainer
 
6
  model-index:
7
  - name: events-mem-large
8
  results: []
 
38
  - train_batch_size: 1
39
  - eval_batch_size: 1
40
  - seed: 42
41
+ - gradient_accumulation_steps: 2
42
+ - total_train_batch_size: 2
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
  - training_steps: 1
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:------:|:----:|:---------------:|
51
+ | 0.0 | 0.0004 | 1 | nan |
52
 
53
 
54
  ### Framework versions
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0b38974a0249cb06cdd31d5f1d7364281fbbc7f9f9c8af240f8427766d5c3c3a
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:97c12a521748071b509cb67a6acdf90e9d784ac9b58519c82226059adff39fef
3
  size 5112