ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering Paper β’ 2504.05506 β’ Published Apr 7 β’ 24
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper β’ 2502.01341 β’ Published Feb 3 β’ 39
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper β’ 2412.04626 β’ Published Dec 5, 2024 β’ 14
Chart-to-Text: A Large-Scale Benchmark for Chart Summarization Paper β’ 2203.06486 β’ Published Mar 12, 2022
ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning Paper β’ 2203.10244 β’ Published Mar 19, 2022
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild Paper β’ 2407.04172 β’ Published Jul 4, 2024 β’ 27
UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning Paper β’ 2305.14761 β’ Published May 24, 2023
Do LLMs Work on Charts? Designing Few-Shot Prompts for Chart Question Answering and Summarization Paper β’ 2312.10610 β’ Published Dec 17, 2023 β’ 1
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning Paper β’ 2403.09028 β’ Published Mar 14, 2024