UNIST-Eunchan
/

Research-Paper-Summarization-Pegasus-x-ArXiv

text2text-generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

UNIST-Eunchan commited on Nov 10, 2023

Commit

333f437

•

1 Parent(s): 510a7fc

Update README.md

Files changed (1) hide show

README.md +14 -10

README.md CHANGED Viewed

@@ -92,7 +92,7 @@ Paper Summarization
   ```(python)
   model.generate(input_ids =inputs["input_ids"].to(device),
                               attention_mask=inputs["attention_mask"].to(device),
-                              num_beam_groups=16,diversity_penalty=1.0,num_beams=16,min_length=100,max_length=128*4)
   ```
@@ -109,15 +109,19 @@ We use huggingface-based environment such as datasets, trainer, etc.
 ### Training hyperparameters
 The following hyperparameters were used during training:
-```learning_rate: 1e-05,train_batch_size: 1
-- eval_batch_size: 1
-- seed: 42
-- gradient_accumulation_steps: 64
-- total_train_batch_size: 64
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 1586
-- num_epochs: 5```
 ### Framework versions

   ```(python)
   model.generate(input_ids =inputs["input_ids"].to(device),
                               attention_mask=inputs["attention_mask"].to(device),
+                              num_beam_groups=5,diversity_penalty=1.0,num_beams=5,min_length=150,max_length=128*4)
   ```
 ### Training hyperparameters
 The following hyperparameters were used during training:
+```(python)
+learning_rate: 1e-05,
+train_batch_size: 1,
+eval_batch_size: 1,
+seed: 42,
+gradient_accumulation_steps: 64,
+total_train_batch_size: 64,
+optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08,
+lr_scheduler_type: linear,
+lr_scheduler_warmup_steps: 1586,
+num_epochs: 5
+```
 ### Framework versions