robgreenberg3 commited on
Commit
5ba2601
·
verified ·
1 Parent(s): fa775a9

Update README.md

Browse files

Fixed typo from INT4 to INT8 as this is an INT8 model

Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -12,14 +12,14 @@ base_model: Qwen/Qwen2.5-VL-72B-Instruct
12
  library_name: transformers
13
  ---
14
 
15
- # Qwen2.5-VL-72B-Instruct-quantized-w4a16
16
 
17
  ## Model Overview
18
  - **Model Architecture:** Qwen/Qwen2.5-VL-72B-Instruct
19
  - **Input:** Vision-Text
20
  - **Output:** Text
21
  - **Model Optimizations:**
22
- - **Weight quantization:** INT4
23
  - **Activation quantization:** FP16
24
  - **Release Date:** 2/24/2025
25
  - **Version:** 1.0
 
12
  library_name: transformers
13
  ---
14
 
15
+ # Qwen2.5-VL-72B-Instruct-quantized-w8a8
16
 
17
  ## Model Overview
18
  - **Model Architecture:** Qwen/Qwen2.5-VL-72B-Instruct
19
  - **Input:** Vision-Text
20
  - **Output:** Text
21
  - **Model Optimizations:**
22
+ - **Weight quantization:** INT8
23
  - **Activation quantization:** FP16
24
  - **Release Date:** 2/24/2025
25
  - **Version:** 1.0