Tianning commited on
Commit
ff402ac
β€’
1 Parent(s): 63f47b2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -13,11 +13,19 @@ The key idea is to combine the bounding box modality with text, achieving a pixe
13
 
14
  ![Example Image](https://raw.githubusercontent.com/veya2ztn/Lougat/main/images/image.png)
15
 
16
- The name "Lougat" is a combination of LLama and Nougat. In this repo, you'll also find other combinations like:
17
 
 
 
 
18
  - Florence2 + LLama β†’ Flougat
19
  - Sam2 + LLama β†’ Slougat
20
  - Nougat + Relative Position Embedding LLama β†’ Rlougat
21
 
22
- The key idea is nature continues of this paper [LOCR: Location-Guided Transformer for Optical Character Recognition]([[2403.02127\] LOCR: Location-Guided Transformer for Optical Character Recognition (arxiv.org)](https://arxiv.org/abs/2403.02127))
 
 
 
 
 
23
 
 
13
 
14
  ![Example Image](https://raw.githubusercontent.com/veya2ztn/Lougat/main/images/image.png)
15
 
16
+ The name "Lougat" is a combination of LLama and Nougat. The key idea is nature continues of this paper [LOCR: Location-Guided Transformer for Optical Character Recognition]([[2403.02127\] LOCR: Location-Guided Transformer for Optical Character Recognition (arxiv.org)](https://arxiv.org/abs/2403.02127))
17
 
18
+ Current Branch: The **LOCR** model
19
+
20
+ Other Branch:
21
  - Florence2 + LLama β†’ Flougat
22
  - Sam2 + LLama β†’ Slougat
23
  - Nougat + Relative Position Embedding LLama β†’ Rlougat
24
 
25
+
26
+ # Inference and Train
27
+
28
+ Please see `https://github.com/veya2ztn/Lougat`
29
+
30
+
31