model update
Browse files
README.md
CHANGED
@@ -46,23 +46,38 @@ model-index:
|
|
46 |
- name: MoverScore (Question Generation)
|
47 |
type: moverscore_question_generation
|
48 |
value: 55.88
|
49 |
-
- name:
|
50 |
-
type:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
51 |
value: 90.66
|
52 |
-
- name: QAAlignedRecall-BERTScore (Question & Answer Generation) [Gold Answer]
|
53 |
-
type:
|
54 |
value: 90.69
|
55 |
-
- name: QAAlignedPrecision-BERTScore (Question & Answer Generation) [Gold Answer]
|
56 |
-
type:
|
57 |
value: 90.64
|
58 |
-
- name: QAAlignedF1Score-MoverScore (Question & Answer Generation) [Gold Answer]
|
59 |
-
type:
|
60 |
value: 65.36
|
61 |
-
- name: QAAlignedRecall-MoverScore (Question & Answer Generation) [Gold Answer]
|
62 |
-
type:
|
63 |
value: 65.36
|
64 |
-
- name: QAAlignedPrecision-MoverScore (Question & Answer Generation) [Gold Answer]
|
65 |
-
type:
|
66 |
value: 65.37
|
67 |
---
|
68 |
|
@@ -117,16 +132,24 @@ output = pipe("Empfangs- und Sendeantenne sollen in ihrer Polarisation übereins
|
|
117 |
| ROUGE_L | 11.19 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
118 |
|
119 |
|
120 |
-
- ***Metric (Question & Answer Generation)***:
|
121 |
|
122 |
| | Score | Type | Dataset |
|
123 |
|:--------------------------------|--------:|:--------|:-----------------------------------------------------------------|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
124 |
| QAAlignedF1Score (BERTScore) | 90.66 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
125 |
| QAAlignedF1Score (MoverScore) | 65.36 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
126 |
| QAAlignedPrecision (BERTScore) | 90.64 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
127 |
| QAAlignedPrecision (MoverScore) | 65.37 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
128 |
| QAAlignedRecall (BERTScore) | 90.69 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
129 |
| QAAlignedRecall (MoverScore) | 65.36 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
|
|
130 |
|
131 |
|
132 |
|
|
|
46 |
- name: MoverScore (Question Generation)
|
47 |
type: moverscore_question_generation
|
48 |
value: 55.88
|
49 |
+
- name: BLEU4 (Question & Answer Generation (with Gold Answer))
|
50 |
+
type: bleu4_question_answer_generation_with_gold_answer
|
51 |
+
value: 0.09
|
52 |
+
- name: ROUGE-L (Question & Answer Generation (with Gold Answer))
|
53 |
+
type: rouge_l_question_answer_generation_with_gold_answer
|
54 |
+
value: 16.18
|
55 |
+
- name: METEOR (Question & Answer Generation (with Gold Answer))
|
56 |
+
type: meteor_question_answer_generation_with_gold_answer
|
57 |
+
value: 19.96
|
58 |
+
- name: BERTScore (Question & Answer Generation (with Gold Answer))
|
59 |
+
type: bertscore_question_answer_generation_with_gold_answer
|
60 |
+
value: 74.4
|
61 |
+
- name: MoverScore (Question & Answer Generation (with Gold Answer))
|
62 |
+
type: moverscore_question_answer_generation_with_gold_answer
|
63 |
+
value: 52.95
|
64 |
+
- name: QAAlignedF1Score-BERTScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
65 |
+
type: qa_aligned_f1_score_bertscore_question_answer_generation_with_gold_answer_gold_answer
|
66 |
value: 90.66
|
67 |
+
- name: QAAlignedRecall-BERTScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
68 |
+
type: qa_aligned_recall_bertscore_question_answer_generation_with_gold_answer_gold_answer
|
69 |
value: 90.69
|
70 |
+
- name: QAAlignedPrecision-BERTScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
71 |
+
type: qa_aligned_precision_bertscore_question_answer_generation_with_gold_answer_gold_answer
|
72 |
value: 90.64
|
73 |
+
- name: QAAlignedF1Score-MoverScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
74 |
+
type: qa_aligned_f1_score_moverscore_question_answer_generation_with_gold_answer_gold_answer
|
75 |
value: 65.36
|
76 |
+
- name: QAAlignedRecall-MoverScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
77 |
+
type: qa_aligned_recall_moverscore_question_answer_generation_with_gold_answer_gold_answer
|
78 |
value: 65.36
|
79 |
+
- name: QAAlignedPrecision-MoverScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
80 |
+
type: qa_aligned_precision_moverscore_question_answer_generation_with_gold_answer_gold_answer
|
81 |
value: 65.37
|
82 |
---
|
83 |
|
|
|
132 |
| ROUGE_L | 11.19 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
133 |
|
134 |
|
135 |
+
- ***Metric (Question & Answer Generation, Reference Answer)***: Each question is generated from *the gold answer*. [raw metric file](https://huggingface.co/lmqg/mbart-large-cc25-dequad-qg/raw/main/eval/metric.first.answer.paragraph.questions_answers.lmqg_qg_dequad.default.json)
|
136 |
|
137 |
| | Score | Type | Dataset |
|
138 |
|:--------------------------------|--------:|:--------|:-----------------------------------------------------------------|
|
139 |
+
| BERTScore | 74.4 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
140 |
+
| Bleu_1 | 14.89 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
141 |
+
| Bleu_2 | 6.69 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
142 |
+
| Bleu_3 | 0.64 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
143 |
+
| Bleu_4 | 0.09 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
144 |
+
| METEOR | 19.96 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
145 |
+
| MoverScore | 52.95 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
146 |
| QAAlignedF1Score (BERTScore) | 90.66 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
147 |
| QAAlignedF1Score (MoverScore) | 65.36 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
148 |
| QAAlignedPrecision (BERTScore) | 90.64 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
149 |
| QAAlignedPrecision (MoverScore) | 65.37 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
150 |
| QAAlignedRecall (BERTScore) | 90.69 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
151 |
| QAAlignedRecall (MoverScore) | 65.36 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
152 |
+
| ROUGE_L | 16.18 | default | [lmqg/qg_dequad](https://huggingface.co/datasets/lmqg/qg_dequad) |
|
153 |
|
154 |
|
155 |
|