Dang Phuong Nam
commited on
Commit
•
82dc9ae
1
Parent(s):
ce67b8a
Update README.md
Browse files
README.md
CHANGED
@@ -115,13 +115,12 @@ Train data should be a json file, where each line is a dict like this:
|
|
115 |
{"query": str, "pos": List[str], "neg": List[str]}
|
116 |
```
|
117 |
|
118 |
-
`query` is the query, and `pos` is a list of positive texts, `neg` is a list of negative texts
|
119 |
-
|
120 |
-
entire corpus as the negatives.
|
121 |
|
122 |
## Performance
|
123 |
|
124 |
-
|
125 |
the [MS MMarco Passage Reranking - Vi - Dev](https://huggingface.co/datasets/unicamp-dl/mmarco) dataset.
|
126 |
|
127 |
| Model-Name | NDCG@3 | MRR@3 | NDCG@5 | MRR@5 | NDCG@10 | MRR@10 | Docs / Sec |
|
|
|
115 |
{"query": str, "pos": List[str], "neg": List[str]}
|
116 |
```
|
117 |
|
118 |
+
`query` is the query, and `pos` is a list of positive texts, `neg` is a list of negative texts. If you have no negative
|
119 |
+
texts for a query, you can random sample some from the entire corpus as the negatives.
|
|
|
120 |
|
121 |
## Performance
|
122 |
|
123 |
+
Below is a comparision table of the results we achieved compared to some other pre-trained Cross-Encoders on
|
124 |
the [MS MMarco Passage Reranking - Vi - Dev](https://huggingface.co/datasets/unicamp-dl/mmarco) dataset.
|
125 |
|
126 |
| Model-Name | NDCG@3 | MRR@3 | NDCG@5 | MRR@5 | NDCG@10 | MRR@10 | Docs / Sec |
|