CalamitousFelicitousness
commited on
Commit
•
4cf63ac
1
Parent(s):
2014bda
Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ tags:
|
|
10 |
base_model: Qwen/Qwen2-VL-72B-Instruct
|
11 |
---
|
12 |
|
13 |
-
# This repo contains a fix for intermediate_size which was incompatible with VLLM parallel inference.
|
14 |
|
15 |
# Qwen2-VL-72B-Instruct-GPTQ-Int4
|
16 |
|
|
|
10 |
base_model: Qwen/Qwen2-VL-72B-Instruct
|
11 |
---
|
12 |
|
13 |
+
# This repo contains a fix for intermediate_size which was incompatible with VLLM parallel inference. This repo will allow you to run with tensor_parallel of 2.
|
14 |
|
15 |
# Qwen2-VL-72B-Instruct-GPTQ-Int4
|
16 |
|