What benchmark dataset is used for testing hallucination?

#2
by zhiminy - opened

1710805144355.png

hi, it's this: https://huggingface.co/spaces/vectara/Hallucination-evaluation-leaderboard

Thanks for your reply. Thus, it is indeed the CNN DM dataset used for benchmarking the hallucination, right? Why not mention it somewhere in the documentation?

Sign up or log in to comment