Outcome-Refining Process Supervision for Code Generation Paper • 2412.15118 • Published 9 days ago • 15
AutoSurvey: Large Language Models Can Automatically Write Surveys Paper • 2406.10252 • Published Jun 10 • 1
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization Paper • 2306.05087 • Published Jun 8, 2023 • 6