Add pipeline and OCR tags, update paper link

#2
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +17 -5
README.md CHANGED
@@ -1,10 +1,13 @@
1
  ---
2
- license: apache-2.0
3
- language:
4
- - en
5
  base_model:
6
  - Qwen/Qwen2.5-VL-7B-Instruct
 
 
7
  library_name: transformers
 
 
 
 
8
  ---
9
 
10
  <img alt="olmOCR Logo" src="https://cdn-uploads.huggingface.co/production/uploads/6734d6722769638944a5aa2e/DPsr3ZvRF9v-gdMa4EaHW.png" width="300px" style="margin-left:'auto' margin-right:'auto' display:'block'">
@@ -18,7 +21,7 @@ This is a release of the olmOCR model that's fine tuned from Qwen2.5-VL-7B-Instr
18
  fine tuned using GRPO RL training to boost its performance at math equations, tables, and other tricky OCR cases.
19
 
20
  Quick links:
21
- - πŸ“ƒ [Paper](https://olmocr.allenai.org/papers/olmocr.pdf)
22
  - πŸ€— [SFT Dataset](https://huggingface.co/datasets/allenai/olmOCR-mix-1025)
23
  - πŸ€— [RL Dataset](https://huggingface.co/datasets/allenai/olmOCR-synthmix-1025)
24
  - πŸ› οΈ [Code](https://github.com/allenai/olmocr)
@@ -165,7 +168,16 @@ text_output = processor.tokenizer.batch_decode(
165
  )
166
 
167
  print(text_output)
168
- # ['---\nprimary_language: en\nis_rotation_valid: True\nrotation_correction: 0\nis_table: False\nis_diagram: False\n---\nolmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models\n\nJake Poz']
 
 
 
 
 
 
 
 
 
169
  ```
170
 
171
  ## License and use
 
1
  ---
 
 
 
2
  base_model:
3
  - Qwen/Qwen2.5-VL-7B-Instruct
4
+ language:
5
+ - en
6
  library_name: transformers
7
+ license: apache-2.0
8
+ pipeline_tag: image-to-text
9
+ tags:
10
+ - ocr
11
  ---
12
 
13
  <img alt="olmOCR Logo" src="https://cdn-uploads.huggingface.co/production/uploads/6734d6722769638944a5aa2e/DPsr3ZvRF9v-gdMa4EaHW.png" width="300px" style="margin-left:'auto' margin-right:'auto' display:'block'">
 
21
  fine tuned using GRPO RL training to boost its performance at math equations, tables, and other tricky OCR cases.
22
 
23
  Quick links:
24
+ - πŸ“ƒ [Paper](https://huggingface.co/papers/2510.19817)
25
  - πŸ€— [SFT Dataset](https://huggingface.co/datasets/allenai/olmOCR-mix-1025)
26
  - πŸ€— [RL Dataset](https://huggingface.co/datasets/allenai/olmOCR-synthmix-1025)
27
  - πŸ› οΈ [Code](https://github.com/allenai/olmocr)
 
168
  )
169
 
170
  print(text_output)
171
+ # ['---
172
+ primary_language: en
173
+ is_rotation_valid: True
174
+ rotation_correction: 0
175
+ is_table: False
176
+ is_diagram: False
177
+ ---
178
+ olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models
179
+
180
+ Jake Poz']
181
  ```
182
 
183
  ## License and use