Training on the test task models Collection Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890 • 56 items • Updated Sep 14