This is a flan-t5-based model pre-trained on VG scene graph parsing dataset first and then fine-tuned on FACTUAL scene graph parsing dataset. See model details from 'https://github.com/zhuang-li/FACTUAL/tree/main'.
This is a flan-t5-based model pre-trained on VG scene graph parsing dataset first and then fine-tuned on FACTUAL scene graph parsing dataset. See model details from 'https://github.com/zhuang-li/FACTUAL/tree/main'.