File size: 2,850 Bytes
2a5ec4b
 
 
 
 
 
 
 
 
 
 
19c0418
 
 
 
2a5ec4b
 
 
7d34c70
66322ce
 
 
2a5ec4b
 
 
 
 
 
 
 
 
 
 
 
 
d4b7aa0
2a5ec4b
b35e3c5
 
19c0418
b35e3c5
 
38a45fb
2a5ec4b
 
 
 
fad1ca4
2a5ec4b
fad1ca4
2a5ec4b
 
 
 
c4802fc
 
 
3458e28
99ae297
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
---
language: en
tags:
- t5
datasets:
- squad
license: mit
---

# Question Generation Model

## Github

https://github.com/Seoneun/T5-Question-Generation

## Fine-tuning Dataset

SQuAD 1.1

| Train Data | Dev Data | Test Data |
| ------ | ------ | ------ |
| 75,722 | 10,570 | 11,877 |

## Demo

https://huggingface.co/Sehong/t5-large-QuestionGeneration

## How to use

```python
import torch
from transformers import PreTrainedTokenizerFast
from transformers import T5ForConditionalGeneration

tokenizer = PreTrainedTokenizerFast.from_pretrained('Sehong/t5-large-QuestionGeneration')
model = T5ForConditionalGeneration.from_pretrained('Sehong/t5-large-QuestionGeneration')

# tokenized
'''
text = "answer:Saint Bern ##ade ##tte So ##ubi ##rous content:Architectural ##ly , the school has a Catholic character . At ##op the Main Building ' s gold dome is a golden statue of the Virgin Mary . Immediately in front of the Main Building and facing it , is a copper statue of Christ with arms up ##rai ##sed with the legend "" V ##eni ##te Ad Me O ##m ##nes "" . Next to the Main Building is the Basilica of the Sacred Heart . Immediately behind the b ##asi ##lica is the G ##rot ##to , a Marian place of prayer and reflection . It is a replica of the g ##rot ##to at Lou ##rdes , France where the Virgin Mary reputed ##ly appeared to Saint Bern ##ade ##tte So ##ubi ##rous in 1858 . At the end of the main drive ( and in a direct line that connects through 3 statues and the Gold Dome ) , is a simple , modern stone statue of Mary ."
'''

text = "answer:Saint Bernadette Soubirous content:Architecturally , the school has a Catholic character . Atop the Main Building ' s gold dome is a golden statue of the Virgin Mary . Immediately in front of the Main Building and facing it , is a copper statue of Christ with arms upraised with the legend "" Venite Ad Me Omnes "" . Next to the Main Building is the Basilica of the Sacred Heart . Immediately behind the basilica is the Grotto , a Marian place of prayer and reflection . It is a replica of the grotto at Lourdes , France where the Virgin Mary reputedly appeared to Saint Bernadette Soubirous in 1858 . At the end of the main drive ( and in a direct line that connects through 3 statues and the Gold Dome ) , is a simple , modern stone statue of Mary ."

raw_input_ids = tokenizer.encode(text)
input_ids = [tokenizer.bos_token_id] + raw_input_ids + [tokenizer.eos_token_id]

question_ids = model.generate(torch.tensor([input_ids]))

decode = tokenizer.decode(question_ids.squeeze().tolist(), skip_special_tokens=True)

decode = decode.replace(' # # ', '').replace('  ', ' ').replace(' ##', '')

print(decode)
```

## Evalutation

| BLEU-1 | BLEU-2 | BLEU-3 | BLEU-4 | METEOR | ROUGE-L |
| ------ | ------ | ------ | ------ | ------ | ------- |
| 51.333 | 36.742 | 28.218 | 22.289 | 26.126 | 51.069  |