Code to reproduce MTEB results

#4
by nv-bschifferer - opened

Hello, can you share the code how to reproduce some of the MTEB results with bge-en-icl?

I wonder how examples are selected to add in the instruction prompt for each individual datasets? Are they hand selected?

Beijing Academy of Artificial Intelligence org

thanks a lot. Is it the same example for every query for the same dataset? Can you share some insights how they were selected? Have you selected them by random?

Beijing Academy of Artificial Intelligence org

@nv-bschifferer , yes, we use the same examples for different queries in the same dataset. For the task that has a training split in mteb hf repo, we randomly sample a few examples from the training split. If there is no training split, we use chatgpt to generate some examples for this task.

@Shitao : do you use few-shot prompts during training or fine-tuning or only during evaluation?

Sign up or log in to comment