felipebandeira
commited on
Commit
•
f637104
1
Parent(s):
8a6267d
Update README.md
Browse files
README.md
CHANGED
@@ -9,5 +9,16 @@ metrics:
|
|
9 |
pipeline_tag: image-to-text
|
10 |
---
|
11 |
|
12 |
-
This model extracts information from EU driver's licenses and returns it as JSON.
|
13 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
pipeline_tag: image-to-text
|
10 |
---
|
11 |
|
12 |
+
This model extracts information from EU driver's licenses and returns it as JSON. For optimal performance, we recommend that input images:
|
13 |
+
- have a size of 1192x772
|
14 |
+
- have high resolution and do not contain light reflection effects
|
15 |
+
|
16 |
+
Accuracy
|
17 |
+
- on validation set: 98%
|
18 |
+
- on set of real licenses: 63.93%
|
19 |
+
|
20 |
+
Article describing model:
|
21 |
+
https://medium.com/@ofelipebandeira/transformers-vs-ocr-who-can-read-better-192e6b044dd3
|
22 |
+
|
23 |
+
Article describing synthetic dataset used in training:
|
24 |
+
https://python.plainenglish.io/how-to-create-synthetic-datasets-of-document-images-5f140dee5e40
|