Edit model card

Model Card for mT5-base-HunSum-2

The mT5-base-HunSum-2 is a Hungarian abstractive summarization model, which was trained on the SZTAKI-HLT/HunSum-2-abstractive dataset. The model is based on google/mt5-base.

Intended uses & limitations

  • Model type: Text Summarization
  • Language(s) (NLP): Hungarian
  • Resource(s) for more information:

Parameters

  • Batch Size: 12
  • Learning Rate: 5e-5
  • Weight Decay: 0.01
  • Warmup Steps: 3000
  • Epochs: 10
  • no_repeat_ngram_size: 3
  • num_beams: 5
  • early_stopping: False
  • encoder_no_repeat_ngram_size: 4

Results

Metric Value
ROUGE-1 40.06
ROUGE-2 12.67
ROUGE-L 25.93
Downloads last month
7
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train SZTAKI-HLT/mT5-base-HunSum-2