Add pipeline and OCR tags, update paper link
Browse filesThis PR improves the model card by:
- Adding the `pipeline_tag: image-to-text` to the metadata, ensuring the model is discoverable under the correct task on the Hugging Face Hub.
- Adding `tags: - ocr` to further enhance discoverability for OCR-specific use cases.
- Updating the paper link in the "Quick links" section to point to the official Hugging Face paper page for [olmOCR 2: Unit Test Rewards for Document OCR](https://huggingface.co/papers/2510.19817).
README.md
CHANGED
|
@@ -1,10 +1,13 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
language:
|
| 4 |
-
- en
|
| 5 |
base_model:
|
| 6 |
- Qwen/Qwen2.5-VL-7B-Instruct
|
|
|
|
|
|
|
| 7 |
library_name: transformers
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
---
|
| 9 |
|
| 10 |
<img alt="olmOCR Logo" src="https://cdn-uploads.huggingface.co/production/uploads/6734d6722769638944a5aa2e/DPsr3ZvRF9v-gdMa4EaHW.png" width="300px" style="margin-left:'auto' margin-right:'auto' display:'block'">
|
|
@@ -18,7 +21,7 @@ This is a release of the olmOCR model that's fine tuned from Qwen2.5-VL-7B-Instr
|
|
| 18 |
fine tuned using GRPO RL training to boost its performance at math equations, tables, and other tricky OCR cases.
|
| 19 |
|
| 20 |
Quick links:
|
| 21 |
-
- π [Paper](https://
|
| 22 |
- π€ [SFT Dataset](https://huggingface.co/datasets/allenai/olmOCR-mix-1025)
|
| 23 |
- π€ [RL Dataset](https://huggingface.co/datasets/allenai/olmOCR-synthmix-1025)
|
| 24 |
- π οΈ [Code](https://github.com/allenai/olmocr)
|
|
@@ -165,7 +168,16 @@ text_output = processor.tokenizer.batch_decode(
|
|
| 165 |
)
|
| 166 |
|
| 167 |
print(text_output)
|
| 168 |
-
# ['
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 169 |
```
|
| 170 |
|
| 171 |
## License and use
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
| 2 |
base_model:
|
| 3 |
- Qwen/Qwen2.5-VL-7B-Instruct
|
| 4 |
+
language:
|
| 5 |
+
- en
|
| 6 |
library_name: transformers
|
| 7 |
+
license: apache-2.0
|
| 8 |
+
pipeline_tag: image-to-text
|
| 9 |
+
tags:
|
| 10 |
+
- ocr
|
| 11 |
---
|
| 12 |
|
| 13 |
<img alt="olmOCR Logo" src="https://cdn-uploads.huggingface.co/production/uploads/6734d6722769638944a5aa2e/DPsr3ZvRF9v-gdMa4EaHW.png" width="300px" style="margin-left:'auto' margin-right:'auto' display:'block'">
|
|
|
|
| 21 |
fine tuned using GRPO RL training to boost its performance at math equations, tables, and other tricky OCR cases.
|
| 22 |
|
| 23 |
Quick links:
|
| 24 |
+
- π [Paper](https://huggingface.co/papers/2510.19817)
|
| 25 |
- π€ [SFT Dataset](https://huggingface.co/datasets/allenai/olmOCR-mix-1025)
|
| 26 |
- π€ [RL Dataset](https://huggingface.co/datasets/allenai/olmOCR-synthmix-1025)
|
| 27 |
- π οΈ [Code](https://github.com/allenai/olmocr)
|
|
|
|
| 168 |
)
|
| 169 |
|
| 170 |
print(text_output)
|
| 171 |
+
# ['---
|
| 172 |
+
primary_language: en
|
| 173 |
+
is_rotation_valid: True
|
| 174 |
+
rotation_correction: 0
|
| 175 |
+
is_table: False
|
| 176 |
+
is_diagram: False
|
| 177 |
+
---
|
| 178 |
+
olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models
|
| 179 |
+
|
| 180 |
+
Jake Poz']
|
| 181 |
```
|
| 182 |
|
| 183 |
## License and use
|