What's the instruction you added for each task in training?

#3
by nachtsky1077 - opened

You mentioned that you added different instructions for tasks like retrieval and reranking, could you help provide those instructions? Thanks!

HITsz-Text Machine Group org

You mentioned that you added different instructions for tasks like retrieval and reranking, could you help provide those instructions? Thanks!

For retrieval and reranking, we use the following instruction:

Instruct: Given a query, retrieve documents that answer the query. \n Query: {query}

For other tasks (including evaluation and potential training), please refer to the instructions provided on this previous issue.
We will release the instructions and complete code for testing in the future, making it more convenient for reference at that time.

What if the query is multilingual? Do you use English instructions?

HITsz-Text Machine Group org

What if the query is multilingual? Do you use English instructions?

Yes, instructions for multilingual tasks are generally provided in English, as it is more universally understood.
You can also specify the language requirements in the instruction to help the model better recognize the language.
Of course, I have also seen developers use other languages directly as instructions (such as Russian), which could also be worth trying.

HITsz-Text Machine Group org

News: For the complete instructions on the MTEB evaluation, please refer to our technical report (in the appendix) and code.

Sign up or log in to comment