Update README.md
Browse files
README.md
CHANGED
@@ -51,6 +51,11 @@ It achieves the following results on the evaluation set:
|
|
51 |
|
52 |
## Model description
|
53 |
|
|
|
|
|
|
|
|
|
|
|
54 |
More information needed
|
55 |
|
56 |
## Intended uses & limitations
|
|
|
51 |
|
52 |
## Model description
|
53 |
|
54 |
+
LayoutLMv3 is a pre-trained multimodal Transformer for Document AI with unified text and image masking. The simple unified architecture and training objectives make LayoutLMv3 a general-purpose pre-trained model. For example, LayoutLMv3 can be fine-tuned for both text-centric tasks, including form understanding, receipt understanding, and document visual question answering, and image-centric tasks such as document image classification and document layout analysis.
|
55 |
+
|
56 |
+
[LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking](https://arxiv.org/abs/2204.08387)
|
57 |
+
Yupan Huang, Tengchao Lv, Lei Cui, Yutong Lu, Furu Wei, Preprint 2022.
|
58 |
+
|
59 |
More information needed
|
60 |
|
61 |
## Intended uses & limitations
|