What benchmark dataset is used for testing hallucination?

#2
by zhiminy - opened

hi, it's this: https://huggingface.co/spaces/vectara/Hallucination-evaluation-leaderboard

Thanks for your reply. Thus, it is indeed the CNN DM dataset used for benchmarking the hallucination, right? Why not mention it somewhere in the documentation?

Sign up or log in to comment