MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications Paper • 2409.07314 • Published Sep 11 • 50
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks Paper • 2407.21072 • Published Jul 29 • 2