skypro1111
/

mbart-large-50-verbalization

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

skypro1111 commited on Mar 30

Commit

bc487eb

•

1 Parent(s): 698b693

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -18,11 +18,11 @@ widget:
 This model is based on the [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) architecture, renowned for its effectiveness in translation and text generation tasks across numerous languages.
 ## Training Data
-The model was fine-tuned on a subset of 96,780 sentences from the Ubertext dataset, focusing on news content. The verbalized equivalents were created using Google Gemini Pro, providing a rich basis for learning text transformation tasks.
 Dataset [skypro1111/ubertext-2-news-verbalized](https://huggingface.co/datasets/skypro1111/ubertext-2-news-verbalized)
 ## Training Procedure
-The model underwent 70,000 training steps, which is almost 2 epochs, with further training the results degraded.
 ```python
 from transformers import MBartForConditionalGeneration, AutoTokenizer, Trainer, TrainingArguments

 This model is based on the [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50) architecture, renowned for its effectiveness in translation and text generation tasks across numerous languages.
 ## Training Data
+The model was fine-tuned on a subset of 457,610 sentences from the Ubertext dataset, focusing on news content. The verbalized equivalents were created using Google Gemini Pro, providing a rich basis for learning text transformation tasks.
 Dataset [skypro1111/ubertext-2-news-verbalized](https://huggingface.co/datasets/skypro1111/ubertext-2-news-verbalized)
 ## Training Procedure
+The model underwent 410,000 training steps.
 ```python
 from transformers import MBartForConditionalGeneration, AutoTokenizer, Trainer, TrainingArguments