--- license: mit --- DPR model trained for NeuCLIR based on a XLMR-Large C3-pretrained language model with MTT with MS-MARCO English queries and translated documents in Chinese, Persian, and Russian. Translation can be found in [neuMARCO](https://ir-datasets.com/neumarco.html) on `ir-datasets`. Please cite the following papers if you use this model ```bibtex @inproceedings{sigir2022c3, author = {Eugene Yang and Suraj Nair and Ramraj Chandradevan and Rebecca Iglesias-Flores and Douglas W. Oard}, title = {C3: Continued Pretraining with Contrastive Weak Supervision for Cross Language Ad-Hoc Retrieval}, booktitle = {Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) (Short Paper)}, year = {2022}, url = {https://arxiv.org/abs/2204.11989} } @inproceedings{ecir2023mlir, title = {Neural Approaches to Multilingual Information Retrieval}, author = {Dawn Lawrie and Eugene Yang and Douglas W Oard and James Mayfield}, booktitle = {Proceedings of the 45th European Conference on Information Retrieval (ECIR)}, year = {2023}, url = {https://arxiv.org/abs/2209.01335} } ```