MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models Paper • 2502.14302 • Published 20 days ago • 9
TrustLLM: Trustworthiness in Large Language Models Paper • 2401.05561 • Published Jan 10, 2024 • 69