lewtun HF staff commited on
Commit
9e3eb71
1 Parent(s): 55f272a

Add evaluation results on the squad_v2 config of squad_v2

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the squad_v2 config of the [squad_v2](https://huggingface.co/datasets/squad_v2) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-squad_v2-7c1a5e5f-11505530).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=squad_v2).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=squad_v2).

Files changed (1) hide show
  1. README.md +24 -0
README.md CHANGED
@@ -3,6 +3,30 @@ language: en
3
  datasets:
4
  - squad_v2
5
  license: cc-by-4.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  ---
7
 
8
  # roberta-base for QA
 
3
  datasets:
4
  - squad_v2
5
  license: cc-by-4.0
6
+ model-index:
7
+ - name: autoevaluate/roberta-base-squad2
8
+ results:
9
+ - task:
10
+ type: question-answering
11
+ name: Question Answering
12
+ dataset:
13
+ name: squad_v2
14
+ type: squad_v2
15
+ config: squad_v2
16
+ split: validation
17
+ metrics:
18
+ - name: Exact Match
19
+ type: exact_match
20
+ value: 79.9326
21
+ verified: true
22
+ - name: F1
23
+ type: f1
24
+ value: 82.9488
25
+ verified: true
26
+ - name: total
27
+ type: total
28
+ value: 11870
29
+ verified: true
30
  ---
31
 
32
  # roberta-base for QA