sections/finetuning/data.md · flax-community/Multilingual-VQA at 8e7fd4d590ae4f79f0041d43b72a58cf191c2e84

For fine-tuning, we use the VQA 2.0 dataset - particularly, the train and validation sets. We translate all the questions into the four languages specified above using language-specific MarianMT models. This is because MarianMT models return better labels and are faster, hence, are better for fine-tuning. We get 4x the number of examples in each subset.