-
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS
Paper • 2411.19655 • Published • 20 -
Babelscape/LLM-Oasis_claim_extraction
Viewer • Updated • 81.3k • 28 • 6 -
Babelscape/LLM-Oasis_claim_verification
Viewer • Updated • 2.66k • 8 • 5 -
Babelscape/LLM-Oasis_e2e_factuality_evaluation
Viewer • Updated • 1.71k • 17 • 5
AI & ML interests
Babelscape is a deep tech company founded in 2016 focused on multilingual Natural Language Processing.
Recent Activity
View all activity
FENICE is a metric for summarization factuality, with a focus on interpretability. FENICE leverages NLI and claim extraction to assess factuality
-
Babelscape/t5-base-summarization-claim-extractor
0.2B • Updated • 221k • 12 -
Babelscape/story-summeval
Viewer • Updated • 319 • 15 • 8 -
Babelscape/FENICE
Updated • 6 -
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction
Paper • 2403.02270 • Published • 3
-
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS
Paper • 2411.19655 • Published • 20 -
Babelscape/LLM-Oasis_claim_extraction
Viewer • Updated • 81.3k • 28 • 6 -
Babelscape/LLM-Oasis_claim_verification
Viewer • Updated • 2.66k • 8 • 5 -
Babelscape/LLM-Oasis_e2e_factuality_evaluation
Viewer • Updated • 1.71k • 17 • 5
Word Sense Linking is the task designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory.
FENICE is a metric for summarization factuality, with a focus on interpretability. FENICE leverages NLI and claim extraction to assess factuality
-
Babelscape/t5-base-summarization-claim-extractor
0.2B • Updated • 221k • 12 -
Babelscape/story-summeval
Viewer • Updated • 319 • 15 • 8 -
Babelscape/FENICE
Updated • 6 -
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction
Paper • 2403.02270 • Published • 3