sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers_reformatted_2 Viewer • Updated 28 days ago • 2.61k • 42
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers_judged Viewer • Updated 28 days ago • 2.61k • 65
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers_reformatted Viewer • Updated 28 days ago • 2.61k • 53
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2x_answers Viewer • Updated 29 days ago • 5.22k • 46
sumukshashidhar-testing/yourbench_y1_single_shot_questions_v2 Viewer • Updated about 1 month ago • 2.61k • 52
sumukshashidhar-testing/yourbench_y1_singleshot_answers_reformatted Viewer • Updated about 1 month ago • 3.49k • 42
sumukshashidhar-testing/yourbench_y1_single_shot_questions Viewer • Updated Dec 13, 2024 • 2.93k • 40
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models Paper • 2310.07611 • Published Oct 11, 2023 • 2