CalamitousFelicitousness
/

Qwen2-VL-72B-Instruct-GPTQ-Int4-tpfix

Image-Text-to-Text

4-bit precision

Model card Files Files and versions Community

CalamitousFelicitousness commited on Sep 22

Commit

4cf63ac

•

1 Parent(s): 2014bda

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ tags:
 base_model: Qwen/Qwen2-VL-72B-Instruct
 ---
-# This repo contains a fix for intermediate_size which was incompatible with VLLM parallel inference.
 # Qwen2-VL-72B-Instruct-GPTQ-Int4

 base_model: Qwen/Qwen2-VL-72B-Instruct
 ---
+# This repo contains a fix for intermediate_size which was incompatible with VLLM parallel inference. This repo will allow you to run with tensor_parallel of 2.
 # Qwen2-VL-72B-Instruct-GPTQ-Int4