OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems Paper • 2402.14008 • Published Feb 21, 2024
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning Paper • 2505.16483 • Published May 22, 2025 • 10
GLTW: Joint Improved Graph Transformer and LLM via Three-Word Language for Knowledge Graph Completion Paper • 2502.11471 • Published Feb 17, 2025 • 1
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models Paper • 2305.08322 • Published May 15, 2023
FaithLens: Detecting and Explaining Faithfulness Hallucination Paper • 2512.20182 • Published Dec 23, 2025 • 9
InFi-Check: Interpretable and Fine-Grained Fact-Checking of LLMs Paper • 2601.06666 • Published 22 days ago • 1