macavaney commited on
Commit
f304d0d
·
verified ·
1 Parent(s): 6b672fc

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ For use with the [`pyterrier-quality`](https://github.com/terrierteam/pyterrier-quality) package.
2
+
3
+ A version of mt5-small trained as a passage quality estimation model using the approach described in [this paper](https://arxiv.org/pdf/2407.12170), over the following datasets:
4
+ msmarco-passage, mmarco/de, mmarco/es, mmarco/fr, mmarco/id, mmarco/it, mmarco/pt, mmarco/ru, mmarco/v2/ar, mmarco/v2/de, mmarco/v2/dt, mmarco/v2/es, mmarco/v2/fr, mmarco/v2/hi, mmarco/v2/id, mmarco/v2/it, mmarco/v2/ja, mmarco/v2/pt, mmarco/v2/ru, mmarco/v2/vi, mmarco/v2/zh, mmarco/zh, neumarco/fa, neumarco/ru, neumarco/zh
5
+
6
+ ```python
7
+ >>> from pyterrier_quality import QualT5
8
+ >>> qt5 = QualT5('pyterrier-quality/mqt5-small')
9
+ >>> qt5([
10
+ ... {'docno': '0', 'text': 'bla bla bla'},
11
+ ... {'docno': '0', 'text': 'The presence of communication amid scientific minds was equally important to the success of the Manhattan Project as scientific intellect was. The only cloud hanging over the impressive achievement of the atomic researchers and engineers is what their success truly meant; hundreds of thousands of innocent lives obliterated.'},
12
+ ... ])
13
+ docno text quality
14
+ 0 0 bla bla bla -1.406250
15
+ 1 0 The presence of communication amid scientific ... -0.828125
16
+ >>> # A larger quality score means higher quality
17
+ ```