lewtun HF staff commited on
Commit
039ef09
1 Parent(s): 2b086ab

Add evaluation results on cnn_dailymail dataset

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the [cnn_dailymail](https://huggingface.co/datasets/cnn_dailymail) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-3aabac9e-7554868).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=cnn_dailymail).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=cnn_dailymail).

Files changed (1) hide show
  1. README.md +34 -1
README.md CHANGED
@@ -10,7 +10,40 @@ datasets:
10
  - cnn_dailymail
11
  model-index:
12
  - name: mbert-finetune-en-cnn
13
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
10
  - cnn_dailymail
11
  model-index:
12
  - name: mbert-finetune-en-cnn
13
+ results:
14
+ - task:
15
+ type: summarization
16
+ name: Summarization
17
+ dataset:
18
+ name: cnn_dailymail
19
+ type: cnn_dailymail
20
+ config: 3.0.0
21
+ split: test
22
+ metrics:
23
+ - name: ROUGE-1
24
+ type: rouge
25
+ value: 40.5167
26
+ verified: true
27
+ - name: ROUGE-2
28
+ type: rouge
29
+ value: 17.9329
30
+ verified: true
31
+ - name: ROUGE-L
32
+ type: rouge
33
+ value: 27.6258
34
+ verified: true
35
+ - name: ROUGE-LSUM
36
+ type: rouge
37
+ value: 37.389
38
+ verified: true
39
+ - name: loss
40
+ type: loss
41
+ value: 2.2861809730529785
42
+ verified: true
43
+ - name: gen_len
44
+ type: gen_len
45
+ value: 87.9999
46
+ verified: true
47
  ---
48
 
49
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You