Natet
/

rut5_base_sum_gazeta-finetuned_week_gpt

text2text-generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Natet commited on Dec 30, 2023

Commit

9060a2e

·

1 Parent(s): eddee13

Training

Files changed (3) hide show

README.md +71 -0
generation_config.json +9 -0
pytorch_model.bin +1 -1

README.md ADDED Viewed

	@@ -0,0 +1,71 @@

+---
+license: apache-2.0
+base_model: IlyaGusev/rut5_base_sum_gazeta
+tags:
+- summarization_3
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: rut5_base_sum_gazeta-finetuned_week_gpt
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# rut5_base_sum_gazeta-finetuned_week_gpt
+This model is a fine-tuned version of [IlyaGusev/rut5_base_sum_gazeta](https://huggingface.co/IlyaGusev/rut5_base_sum_gazeta) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.0665
+- Rouge1: 38.7802
+- Rouge2: 18.8758
+- Rougel: 38.1542
+- Rougelsum: 38.195
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5.6e-05
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 8
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| No log        | 1.0   | 555  | 1.1788          | 36.7978 | 17.6912 | 36.1337 | 36.1391   |
+| 1.3896        | 2.0   | 1110 | 1.0992          | 37.9462 | 18.6497 | 37.3932 | 37.4791   |
+| 1.3896        | 3.0   | 1665 | 1.1053          | 38.8205 | 18.8297 | 38.0614 | 38.1843   |
+| 1.1331        | 4.0   | 2220 | 1.1029          | 38.3632 | 18.7051 | 37.654  | 37.7872   |
+| 1.1331        | 5.0   | 2775 | 1.0798          | 39.1371 | 18.8761 | 38.4425 | 38.4942   |
+| 1.0312        | 6.0   | 3330 | 1.0602          | 38.6421 | 18.9015 | 38.0504 | 38.0638   |
+| 1.0312        | 7.0   | 3885 | 1.0650          | 39.2291 | 19.0341 | 38.6098 | 38.6528   |
+| 0.975         | 8.0   | 4440 | 1.0665          | 38.7802 | 18.8758 | 38.1542 | 38.195    |
+### Framework versions
+- Transformers 4.33.0
+- Pytorch 2.0.0
+- Datasets 2.1.0
+- Tokenizers 0.13.3

generation_config.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+  "bos_token_id": 2,
+  "decoder_start_token_id": 2,
+  "eos_token_id": 1,
+  "max_length": 200,
+  "num_beams": 5,
+  "pad_token_id": 0,
+  "transformers_version": "4.33.0"
+}

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:00a028067e58662b2c191c5c90bb0e9555620c56dc0141a6ff0de7c4e7aa2474
 size 977334453

 version https://git-lfs.github.com/spec/v1
+oid sha256:e389c8ec092a2cfb7b459edd3c94c4b026e98bc049e5704d8b90237db798e33f
 size 977334453