guerra-llm-ai-leaderboard

Running

App Files Files Community

What benchmark dataset is used for testing hallucination?

by zhiminy - opened Mar 18, 2024

Discussion

zhiminy

Mar 18, 2024

•

edited Mar 18, 2024

luisrguerra

Owner Mar 19, 2024

hi, it's this: https://huggingface.co/spaces/vectara/Hallucination-evaluation-leaderboard

zhiminy

Mar 19, 2024

•

edited Mar 19, 2024

hi, it's this: https://huggingface.co/spaces/vectara/Hallucination-evaluation-leaderboard

Thanks for your reply. Thus, it is indeed the CNN DM dataset used for benchmarking the hallucination, right? Why not mention it somewhere in the documentation?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment