autoevaluator HF staff commited on
Commit
cc1d07e
1 Parent(s): 7ed5f44

Add verifyToken field to verify evaluation results are produced by Hugging Face's automatic model evaluator

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋! We've added a new `verifyToken` field to your evaluation results to verify that they are produced by the model evaluator. Accept this PR to ensure that your results remain listed as **verified** on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards).

Files changed (1) hide show
  1. README.md +6 -40
README.md CHANGED
@@ -1,39 +1,3 @@
1
- ## Albert Transformer on SQuAD-v2
2
-
3
- Training is done on the [SQuAD_v2](https://rajpurkar.github.io/SQuAD-explorer/) dataset. The model can be accessed via HuggingFace:
4
-
5
- ## Model Specifications
6
-
7
- We have used the following parameters:
8
-
9
- - num_train_epochs=0.25,
10
- - per_device_train_batch_size=5,
11
- - per_device_eval_batch_size=10,
12
- - warmup_steps=100,
13
- - weight_decay=0.01,
14
-
15
- ## Usage Specifications
16
-
17
- ```python
18
- from transformers import AutoTokenizer,AutoModelForQuestionAnswering
19
- from transformers import pipeline
20
- model=AutoModelForQuestionAnswering.from_pretrained('abhilash1910/albert-squad-v2')
21
- tokenizer=AutoTokenizer.from_pretrained('abhilash1910/albert-squad-v2')
22
- nlp_QA=pipeline('question-answering',model=model,tokenizer=tokenizer)
23
- QA_inp={
24
- 'question': 'How many parameters does Bert large have?',
25
- 'context': 'Bert large is really big... it has 24 layers, for a total of 340M parameters.Altogether it is 1.34 GB so expect it to take a couple minutes to download to your Colab instance.'
26
- }
27
- result=nlp_QA(QA_inp)
28
- result
29
- ```
30
- ## Result
31
-
32
- The result is:
33
-
34
-
35
- {'answer': '340M', 'end': 65, 'score': 0.14847151935100555, 'start': 61}
36
-
37
  ---
38
  language:
39
  - en
@@ -52,12 +16,14 @@ model-index:
52
  config: squad_v2
53
  split: validation
54
  metrics:
55
- - name: Exact Match
56
- type: exact_match
57
  value: 23.6563
 
58
  verified: true
59
- - name: F1
60
- type: f1
61
  value: 29.3808
 
62
  verified: true
 
63
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  language:
3
  - en
 
16
  config: squad_v2
17
  split: validation
18
  metrics:
19
+ - type: exact_match
 
20
  value: 23.6563
21
+ name: Exact Match
22
  verified: true
23
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZTE5ZTM2YzIwZjBhYjM0ZDUyNzBiMjg1YjZhMGJiMGViMjYzYjQ5ZmI4MGFkYmU4YjY1OTNjYzAwZWRlZjIwNSIsInZlcnNpb24iOjF9.jlvV8WRPSPKJm6UdApoh-dXcAOmLPtF5smsHt39RoO4sFzzbH6elUz5yPF5Lt9Yc2YDIl6c8JDsODqMxmsD0Bg
24
+ - type: f1
25
  value: 29.3808
26
+ name: F1
27
  verified: true
28
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiN2ZjYWRlYTI1NDkwYzNhMzM5YTg2NjZmODg0NjNkOGM3YjM2NTlkYjVhZWI0MzlmNjNkMTMxODlkNTY3ODBkMiIsInZlcnNpb24iOjF9.CR1MYeU3uqld9bbI8CLupMtote4WEG9fIq9enwhFJfVpChIT9BGKm86zaPmXHg0yBaNHgkMaEt_a-DaIdiEwAg
29
  ---