asahi417 commited on
Commit
07dccbc
1 Parent(s): 2f8016d

model update

Browse files
Files changed (1) hide show
  1. README.md +52 -62
README.md CHANGED
@@ -33,62 +33,43 @@ model-index:
33
  metrics:
34
  - name: BLEU4
35
  type: bleu4
36
- value: 0.00429117586447163
37
  - name: ROUGE-L
38
  type: rouge-l
39
- value: 0.10079964033809515
40
  - name: METEOR
41
  type: meteor
42
- value: 0.11468160303808986
43
  - name: BERTScore
44
  type: bertscore
45
- value: 0.7990242947308462
46
  - name: MoverScore
47
  type: moverscore
48
- value: 0.54641040830982
49
- - name: QAAlignedF1Score (BERTScore)
50
- type: qa_aligned_f1_score_bertscore
51
- value: 0.9054627676975269
52
- - name: QAAlignedRecall (BERTScore)
53
- type: qa_aligned_recall_bertscore
54
- value: 0.905054414852943
55
- - name: QAAlignedPrecision (BERTScore)
56
- type: qa_aligned_precision_bertscore
57
- value: 0.9058876790018303
58
- - name: QAAlignedF1Score (MoverScore)
59
- type: qa_aligned_f1_score_moverscore
60
- value: 0.6432936856750953
61
- - name: QAAlignedRecall (MoverScore)
62
- type: qa_aligned_recall_moverscore
63
- value: 0.6429189218212675
64
- - name: QAAlignedPrecision (MoverScore)
65
- type: qa_aligned_precision_moverscore
66
- value: 0.6436775977765152
67
  ---
68
 
69
  # Model Card of `lmqg/mt5-small-dequad`
70
- This model is fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) for question generation task on the
71
- [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) (dataset_name: default) via [`lmqg`](https://github.com/asahi417/lm-question-generation).
72
 
73
 
74
- Please cite our paper if you use the model ([https://arxiv.org/abs/2210.03992](https://arxiv.org/abs/2210.03992)).
75
-
76
- ```
77
-
78
- @inproceedings{ushio-etal-2022-generative,
79
- title = "{G}enerative {L}anguage {M}odels for {P}aragraph-{L}evel {Q}uestion {G}eneration",
80
- author = "Ushio, Asahi and
81
- Alva-Manchego, Fernando and
82
- Camacho-Collados, Jose",
83
- booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing",
84
- month = dec,
85
- year = "2022",
86
- address = "Abu Dhabi, U.A.E.",
87
- publisher = "Association for Computational Linguistics",
88
- }
89
-
90
- ```
91
-
92
  ### Overview
93
  - **Language model:** [google/mt5-small](https://huggingface.co/google/mt5-small)
94
  - **Language:** en
@@ -100,42 +81,52 @@ Please cite our paper if you use the model ([https://arxiv.org/abs/2210.03992](h
100
  ### Usage
101
  - With [`lmqg`](https://github.com/asahi417/lm-question-generation#lmqg-language-model-for-question-generation-)
102
  ```python
103
-
104
  from lmqg import TransformersQG
 
105
  # initialize model
106
- model = TransformersQG(language='en', model='lmqg/mt5-small-dequad')
 
107
  # model prediction
108
- question = model.generate_q(list_context=["William Turner was an English painter who specialised in watercolour landscapes"], list_answer=["William Turner"])
109
 
110
  ```
111
 
112
  - With `transformers`
113
  ```python
114
-
115
  from transformers import pipeline
116
- # initialize model
117
- pipe = pipeline("text2text-generation", 'lmqg/mt5-small-dequad')
118
- # question generation
119
- question = pipe('<hl> Beyonce <hl> further expanded her acting career, starring as blues singer Etta James in the 2008 musical biopic, Cadillac Records.')
120
 
121
  ```
122
 
123
- ## Evaluation Metrics
124
 
125
 
126
- ### Metrics
127
 
128
- | Dataset | Type | BLEU4 | ROUGE-L | METEOR | BERTScore | MoverScore | Link |
129
- |:--------|:-----|------:|--------:|-------:|----------:|-----------:|-----:|
130
- | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) | default | 0.004 | 0.101 | 0.115 | 0.799 | 0.546 | [link](https://huggingface.co/lmqg/mt5-small-dequad/raw/main/eval/metric.first.sentence.paragraph_answer.question.lmqg_qg_dequad.default.json) |
 
 
 
 
 
 
 
131
 
132
 
133
- ### Metrics (QAG)
134
 
135
- | Dataset | Type | QA Aligned F1 Score (BERTScore) | QA Aligned F1 Score (MoverScore) | Link |
136
- |:--------|:-----|--------------------------------:|---------------------------------:|-----:|
137
- | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) | default | 0.905 | 0.643 | [link](https://huggingface.co/lmqg/mt5-small-dequad/raw/main/eval/metric.first.answer.paragraph.questions_answers.lmqg_qg_dequad.default.json) |
138
-
 
 
 
 
139
 
140
 
141
 
@@ -162,7 +153,6 @@ The full configuration can be found at [fine-tuning config file](https://hugging
162
 
163
  ## Citation
164
  ```
165
-
166
  @inproceedings{ushio-etal-2022-generative,
167
  title = "{G}enerative {L}anguage {M}odels for {P}aragraph-{L}evel {Q}uestion {G}eneration",
168
  author = "Ushio, Asahi and
 
33
  metrics:
34
  - name: BLEU4
35
  type: bleu4
36
+ value: 0.43
37
  - name: ROUGE-L
38
  type: rouge-l
39
+ value: 10.08
40
  - name: METEOR
41
  type: meteor
42
+ value: 11.47
43
  - name: BERTScore
44
  type: bertscore
45
+ value: 79.9
46
  - name: MoverScore
47
  type: moverscore
48
+ value: 54.64
49
+ - name: QAAlignedF1Score (BERTScore) [Gold Answer]
50
+ type: qa_aligned_f1_score_bertscore_gold_answer
51
+ value: 90.55
52
+ - name: QAAlignedRecall (BERTScore) [Gold Answer]
53
+ type: qa_aligned_recall_bertscore_gold_answer
54
+ value: 90.51
55
+ - name: QAAlignedPrecision (BERTScore) [Gold Answer]
56
+ type: qa_aligned_precision_bertscore_gold_answer
57
+ value: 90.59
58
+ - name: QAAlignedF1Score (MoverScore) [Gold Answer]
59
+ type: qa_aligned_f1_score_moverscore_gold_answer
60
+ value: 64.33
61
+ - name: QAAlignedRecall (MoverScore) [Gold Answer]
62
+ type: qa_aligned_recall_moverscore_gold_answer
63
+ value: 64.29
64
+ - name: QAAlignedPrecision (MoverScore) [Gold Answer]
65
+ type: qa_aligned_precision_moverscore_gold_answer
66
+ value: 64.37
67
  ---
68
 
69
  # Model Card of `lmqg/mt5-small-dequad`
70
+ This model is fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) for question generation task on the [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) (dataset_name: default) via [`lmqg`](https://github.com/asahi417/lm-question-generation).
 
71
 
72
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
73
  ### Overview
74
  - **Language model:** [google/mt5-small](https://huggingface.co/google/mt5-small)
75
  - **Language:** en
 
81
  ### Usage
82
  - With [`lmqg`](https://github.com/asahi417/lm-question-generation#lmqg-language-model-for-question-generation-)
83
  ```python
 
84
  from lmqg import TransformersQG
85
+
86
  # initialize model
87
+ model = TransformersQG(language="en", model="lmqg/mt5-small-dequad")
88
+
89
  # model prediction
90
+ questions = model.generate_q(list_context="William Turner was an English painter who specialised in watercolour landscapes", list_answer="William Turner")
91
 
92
  ```
93
 
94
  - With `transformers`
95
  ```python
 
96
  from transformers import pipeline
97
+
98
+ pipe = pipeline("text2text-generation", "lmqg/mt5-small-dequad")
99
+ output = pipe("<hl> Beyonce <hl> further expanded her acting career, starring as blues singer Etta James in the 2008 musical biopic, Cadillac Records.")
 
100
 
101
  ```
102
 
103
+ ## Evaluation
104
 
105
 
106
+ - ***Metric (Question Generation)***: [raw metric file](https://huggingface.co/lmqg/mt5-small-dequad/raw/main/eval/metric.first.sentence.paragraph_answer.question.lmqg_qg_dequad.default.json)
107
 
108
+ | | Score | Type | Dataset |
109
+ |:-----------|--------:|:--------|:-----------------------------------------------------------------|
110
+ | BERTScore | 79.9 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
111
+ | Bleu_1 | 10.18 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
112
+ | Bleu_2 | 4.02 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
113
+ | Bleu_3 | 1.6 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
114
+ | Bleu_4 | 0.43 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
115
+ | METEOR | 11.47 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
116
+ | MoverScore | 54.64 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
117
+ | ROUGE_L | 10.08 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
118
 
119
 
120
+ - ***Metric (Question & Answer Generation)***: QAG metrics are computed with *the gold answer* and generated question on it for this model, as the model cannot provide an answer. [raw metric file](https://huggingface.co/lmqg/mt5-small-dequad/raw/main/eval/metric.first.answer.paragraph.questions_answers.lmqg_qg_dequad.default.json)
121
 
122
+ | | Score | Type | Dataset |
123
+ |:--------------------------------|--------:|:--------|:-----------------------------------------------------------------|
124
+ | QAAlignedF1Score (BERTScore) | 90.55 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
125
+ | QAAlignedF1Score (MoverScore) | 64.33 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
126
+ | QAAlignedPrecision (BERTScore) | 90.59 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
127
+ | QAAlignedPrecision (MoverScore) | 64.37 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
128
+ | QAAlignedRecall (BERTScore) | 90.51 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
129
+ | QAAlignedRecall (MoverScore) | 64.29 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
130
 
131
 
132
 
 
153
 
154
  ## Citation
155
  ```
 
156
  @inproceedings{ushio-etal-2022-generative,
157
  title = "{G}enerative {L}anguage {M}odels for {P}aragraph-{L}evel {Q}uestion {G}eneration",
158
  author = "Ushio, Asahi and