lewtun HF staff commited on
Commit
72baa94
1 Parent(s): 2a01be2

Add evaluation results on the autoevaluate--squad-sample config and test split of autoevaluate/squad-sample

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the autoevaluate--squad-sample config and test split of the [autoevaluate/squad-sample](https://huggingface.co/datasets/autoevaluate/squad-sample) dataset by

@lewtun
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-c3da4aa4-0386-41d1-9c7c-12d712dd287c-126119).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=autoevaluate/squad-sample).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=autoevaluate/squad-sample).

Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -5,6 +5,33 @@ tags:
5
  datasets:
6
  - squad
7
  duplicated_from: autoevaluate/extractive-question-answering
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
5
  datasets:
6
  - squad
7
  duplicated_from: autoevaluate/extractive-question-answering
8
+ model-index:
9
+ - name: autoevaluate/extractive-question-answering-not-evaluated
10
+ results:
11
+ - task:
12
+ type: question-answering
13
+ name: Question Answering
14
+ dataset:
15
+ name: autoevaluate/squad-sample
16
+ type: autoevaluate/squad-sample
17
+ config: autoevaluate--squad-sample
18
+ split: test
19
+ metrics:
20
+ - type: f1
21
+ value: 76.9929
22
+ name: F1
23
+ verified: true
24
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZWY4NmY3MmUxMTI1YjYxOGQ4ZGJiNTExZWE2MGQ3NWY1MGNkMGZiNDA1M2FiNjM2ODBmMmM3NTM0MTIzYTE3OSIsInZlcnNpb24iOjF9.KCqAF5uiJ5MErIARbRt7ZQQZyMCxyQosMzoDk6Z-_-mLBJ3x8DTJYUKbSgd2QvA7tjnWhIq81ba4tJ0D5OvmBg
25
+ - type: exact_match
26
+ value: 70.0
27
+ name: Exact Match
28
+ verified: true
29
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTRmNTIxNDFhYzBmNDdkOGJmNmQ2YTBjODMxZWZkYzA5Njg2MDVlZjFmZTllMDQ4MzExM2MzMjcwMTc3NmY1MyIsInZlcnNpb24iOjF9.WwsKGMpmxBLVfpwA9qH7f_uVIpECcmkxxUUutHjLnraxZiPG-B_Z7InQ0dWtrtseEIkcEx-Y3u3rnEtzYhNbBA
30
+ - type: loss
31
+ value: 1.1083998680114746
32
+ name: loss
33
+ verified: true
34
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMzUwZGFjMjlhNzMzZGZjZTU5ZDZiYjg0YThiZTA1MDgzOWUxZTQxYmUzNGE2YWYwYTI2YTM1ZDJiNDdmMTcxZCIsInZlcnNpb24iOjF9.udy8AXJOg0hBuhBahq4XcbShp78SDBJz5phkvi4q8EuHEXuBQ1qIxrQbNDpoV2CG8_MBG9EPPtF5d32WiOn2BA
35
  ---
36
 
37
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You