Sharka commited on
Commit
14f4b3c
1 Parent(s): e93fc55

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -1
README.md CHANGED
@@ -2,4 +2,22 @@
2
  license: mit
3
  language:
4
  - cs
5
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: mit
3
  language:
4
  - cs
5
+ ---
6
+
7
+ # LayoutLMv3 Model Fine-tuned with CIVQA (Tesseract) dataset
8
+
9
+ This is a fine-tuned version of the [LayoutLMv3 model](https://huggingface.co/microsoft/layoutlmv3-base), which was trained on Czech Invoice Visual Question Answering (CIVQA) dataset containing invoices in the Czech language as well as on the Data Visualizations via Question Answering ([DVQA] (https://paperswithcode.com/dataset/dvqa)) dataset.
10
+
11
+ This model enables Document Visual Question Answering on Czech invoices with the use of the existing DVQA dataset.
12
+
13
+ Regarding the Czech invoices, we focused on 10 different entities, which are crucial for processing the invoices.
14
+ - Variable symbol
15
+ - Specific symbol
16
+ - Constant symbol
17
+ - Bank code
18
+ - Account number
19
+ - Total amount
20
+ - Invoice date
21
+ - Name of supplier
22
+ - DIC
23
+ - QR code