U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills in LLMs Paper • 2412.03205 • Published 11 days ago • 14
U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills in LLMs Paper • 2412.03205 • Published 11 days ago • 14
U-MATH and μ-MATH - University-level math evaluation Collection Paper: A UNIVERSITY-LEVEL BENCHMARK FOR EVALUATING MATHEMATICAL SKILLS IN LLMS • 3 items • Updated 3 days ago • 15
U-MATH and μ-MATH - University-level math evaluation Collection Paper: A UNIVERSITY-LEVEL BENCHMARK FOR EVALUATING MATHEMATICAL SKILLS IN LLMS • 3 items • Updated 3 days ago • 15
k4black/t5-small-e-snli-generation-explanation_only-selected-b64 Text2Text Generation • Updated Apr 9, 2023 • 16
k4black/google-flan-t5-small-e-snli-generation-explanation_use_prompt_label-selected-b64 Text2Text Generation • Updated Apr 9, 2023 • 31
k4black/google-flan-t5-small-e-snli-generation-label_and_explanation-selected-b64 Text2Text Generation • Updated Apr 9, 2023 • 7
k4black/roberta-large-e-snli-classification-nli_explanation-base-b16 Text Classification • Updated Apr 8, 2023 • 7
k4black/roberta-base-e-snli-classification-nli_explanation-base Text Classification • Updated Apr 8, 2023 • 9
k4black/Salesforce-codet5-small-java-small-selected-wo-tokens Text2Text Generation • Updated Mar 27, 2023 • 13