Spaces:
Sleeping
Sleeping
WebashalarForML
commited on
Update README2.md
Browse files- README2.md +7 -9
README2.md
CHANGED
@@ -3,10 +3,9 @@ _\\-------- **Image Data Extractor** -------\\_
|
|
3 |
_\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\_
|
4 |
|
5 |
---
|
6 |
-
|
7 |
# Overview:
|
8 |
The **Image Data Extractor** is a Python-based tool designed to extract and structure text data from images of visiting cards using **PaddleOCR**. The tool processes the extracted text to recognize key information such as name, designation, contact number, address, and company name, organizing the output into a well-defined structure. The **Mistral 7B model** is used for advanced text analysis, and if it becomes unavailable, the system seamlessly switches to the **Gliner urchade/gliner_mediumv2.1** model.
|
9 |
-
|
10 |
# Installation Guide:
|
11 |
|
12 |
1. **Create and Activate a Virtual Environment**
|
@@ -37,7 +36,7 @@ The **Image Data Extractor** is a Python-based tool designed to extract and stru
|
|
37 |
```bash
|
38 |
HF_TOKEN=<your_huggingface_token>
|
39 |
```
|
40 |
-
|
41 |
# File Structure Overview:
|
42 |
|
43 |
```
|
@@ -71,7 +70,7 @@ ImageDataExtractor/
|
|
71 |
β
|
72 |
βββ .env # Environment variables (includes Hugging Face token)
|
73 |
```
|
74 |
-
|
75 |
# Program Overview:
|
76 |
|
77 |
### PaddleOCR Integration (utility/utils.py):
|
@@ -88,7 +87,7 @@ ImageDataExtractor/
|
|
88 |
### Web Interface (app.py):
|
89 |
- **Flask API**: Provides endpoints for image uploads and displays the results in a structured manner.
|
90 |
- **HTML Interface**: A frontend for users to upload images of visiting cards and view the parsed results.
|
91 |
-
|
92 |
# Tree Map of the Program:
|
93 |
|
94 |
```
|
@@ -108,11 +107,11 @@ Backup/backup.py
|
|
108 |
βββ Backup and error handling
|
109 |
|
110 |
```
|
111 |
-
|
112 |
# Licensing:
|
113 |
- **Mistral 7B model** is used under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).
|
114 |
- **Gliner urchade/gliner_mediumv2.1 model** is used under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).
|
115 |
-
|
116 |
# Main Task:
|
117 |
The main objective is to extract and structure text data from visiting cards. The system identifies and organizes:
|
118 |
- **Name**
|
@@ -120,7 +119,7 @@ The main objective is to extract and structure text data from visiting cards. Th
|
|
120 |
- **Phone Number**
|
121 |
- **Address**
|
122 |
- **Company Name**
|
123 |
-
|
124 |
# References:
|
125 |
|
126 |
- [PaddleOCR Documentation](https://github.com/PaddlePaddle/PaddleOCR)
|
@@ -129,5 +128,4 @@ The main objective is to extract and structure text data from visiting cards. Th
|
|
129 |
- [Flask Documentation](https://flask.palletsprojects.com/)
|
130 |
- [Docker Documentation](https://docs.docker.com/)
|
131 |
- [Virtual Environments in Python](https://docs.python.org/3/tutorial/venv.html)
|
132 |
-
|
133 |
---
|
|
|
3 |
_\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\_
|
4 |
|
5 |
---
|
|
|
6 |
# Overview:
|
7 |
The **Image Data Extractor** is a Python-based tool designed to extract and structure text data from images of visiting cards using **PaddleOCR**. The tool processes the extracted text to recognize key information such as name, designation, contact number, address, and company name, organizing the output into a well-defined structure. The **Mistral 7B model** is used for advanced text analysis, and if it becomes unavailable, the system seamlessly switches to the **Gliner urchade/gliner_mediumv2.1** model.
|
8 |
+
---
|
9 |
# Installation Guide:
|
10 |
|
11 |
1. **Create and Activate a Virtual Environment**
|
|
|
36 |
```bash
|
37 |
HF_TOKEN=<your_huggingface_token>
|
38 |
```
|
39 |
+
---
|
40 |
# File Structure Overview:
|
41 |
|
42 |
```
|
|
|
70 |
β
|
71 |
βββ .env # Environment variables (includes Hugging Face token)
|
72 |
```
|
73 |
+
---
|
74 |
# Program Overview:
|
75 |
|
76 |
### PaddleOCR Integration (utility/utils.py):
|
|
|
87 |
### Web Interface (app.py):
|
88 |
- **Flask API**: Provides endpoints for image uploads and displays the results in a structured manner.
|
89 |
- **HTML Interface**: A frontend for users to upload images of visiting cards and view the parsed results.
|
90 |
+
---
|
91 |
# Tree Map of the Program:
|
92 |
|
93 |
```
|
|
|
107 |
βββ Backup and error handling
|
108 |
|
109 |
```
|
110 |
+
---
|
111 |
# Licensing:
|
112 |
- **Mistral 7B model** is used under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).
|
113 |
- **Gliner urchade/gliner_mediumv2.1 model** is used under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).
|
114 |
+
---
|
115 |
# Main Task:
|
116 |
The main objective is to extract and structure text data from visiting cards. The system identifies and organizes:
|
117 |
- **Name**
|
|
|
119 |
- **Phone Number**
|
120 |
- **Address**
|
121 |
- **Company Name**
|
122 |
+
---
|
123 |
# References:
|
124 |
|
125 |
- [PaddleOCR Documentation](https://github.com/PaddlePaddle/PaddleOCR)
|
|
|
128 |
- [Flask Documentation](https://flask.palletsprojects.com/)
|
129 |
- [Docker Documentation](https://docs.docker.com/)
|
130 |
- [Virtual Environments in Python](https://docs.python.org/3/tutorial/venv.html)
|
|
|
131 |
---
|