|
### Install Git LFS |
|
Before you begin, make sure Git Large File Storage (Git LFS) is installed on your system. Install it using the following command: |
|
|
|
```bash |
|
git lfs install |
|
``` |
|
|
|
### Download the Model from Hugging Face |
|
To download the `PDF-Extract-Kit` model from Hugging Face, use the following command: |
|
|
|
```bash |
|
git lfs clone https://huggingface.co/opendatalab/PDF-Extract-Kit |
|
``` |
|
|
|
Ensure that Git LFS is enabled during the clone to properly download all large files. |
|
|
|
|
|
|
|
### Download the Model from ModelScope |
|
|
|
#### SDK Download |
|
|
|
```bash |
|
# First, install the ModelScope library using pip: |
|
pip install modelscope |
|
``` |
|
|
|
```python |
|
# Use the following Python code to download the model using the ModelScope SDK: |
|
from modelscope import snapshot_download |
|
model_dir = snapshot_download('opendatalab/PDF-Extract-Kit') |
|
``` |
|
|
|
#### Git Download |
|
Alternatively, you can use Git to clone the model repository from ModelScope: |
|
|
|
```bash |
|
git clone https://www.modelscope.cn/opendatalab/PDF-Extract-Kit.git |
|
``` |
|
|
|
|
|
Put [model files]() here: |
|
|
|
``` |
|
./ |
|
βββ Layout |
|
β βββ config.json |
|
β βββ model_final.pth |
|
βββ MFD |
|
β βββ weights.pt |
|
βββ MFR |
|
β βββ UniMERNet |
|
β βββ config.json |
|
β βββ preprocessor_config.json |
|
β βββ pytorch_model.bin |
|
β βββ README.md |
|
β βββ tokenizer_config.json |
|
β βββ tokenizer.json |
|
βββ TabRec |
|
β βββ StructEqTable |
|
β βββ config.json |
|
β βββgeneration_config.json |
|
β βββmodel.safetensors |
|
β βββpreprocessor_config.json |
|
β βββspecial_tokens_map.json |
|
β βββspiece.model |
|
β βββtokenizer_config.json |
|
β βββtokenizer.json |
|
βββ README.md |
|
``` |
|
|