Update README.md
Browse files
README.md
CHANGED
@@ -13,11 +13,19 @@ The key idea is to combine the bounding box modality with text, achieving a pixe
|
|
13 |
|
14 |
![Example Image](https://raw.githubusercontent.com/veya2ztn/Lougat/main/images/image.png)
|
15 |
|
16 |
-
The name "Lougat" is a combination of LLama and Nougat.
|
17 |
|
|
|
|
|
|
|
18 |
- Florence2 + LLama β Flougat
|
19 |
- Sam2 + LLama β Slougat
|
20 |
- Nougat + Relative Position Embedding LLama β Rlougat
|
21 |
|
22 |
-
|
|
|
|
|
|
|
|
|
|
|
23 |
|
|
|
13 |
|
14 |
![Example Image](https://raw.githubusercontent.com/veya2ztn/Lougat/main/images/image.png)
|
15 |
|
16 |
+
The name "Lougat" is a combination of LLama and Nougat. The key idea is nature continues of this paper [LOCR: Location-Guided Transformer for Optical Character Recognition]([[2403.02127\] LOCR: Location-Guided Transformer for Optical Character Recognition (arxiv.org)](https://arxiv.org/abs/2403.02127))
|
17 |
|
18 |
+
Current Branch: The **LOCR** model
|
19 |
+
|
20 |
+
Other Branch:
|
21 |
- Florence2 + LLama β Flougat
|
22 |
- Sam2 + LLama β Slougat
|
23 |
- Nougat + Relative Position Embedding LLama β Rlougat
|
24 |
|
25 |
+
|
26 |
+
# Inference and Train
|
27 |
+
|
28 |
+
Please see `https://github.com/veya2ztn/Lougat`
|
29 |
+
|
30 |
+
|
31 |
|