MarkupLM

Multimodal (text +markup language) pre-training for Document AI

Introduction

MarkupLM is a simple but effective multi-modal pre-training method of text and markup language for visually-rich document understanding and information extraction tasks, such as webpage QA and webpage information extraction. MarkupLM archives the SOTA results on multiple datasets. For more details, please refer to our paper:

MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding Junlong Li, Yiheng Xu, Lei Cui, Furu Wei, ACL 2022

Usage

We refer to the docs and demo notebooks.

Downloads last month: 51,522

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for microsoft/markuplm-base

Finetunes

4 models

Space using microsoft/markuplm-base 1

Paper for microsoft/markuplm-base

MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding

Paper • 2110.08518 • Published Oct 16, 2021 • 2