I use vllm==0.6.3 load this model,it generate the fowllowing error

#1
by wc-llm - opened

when I use vllm==0.6.3 load this model,it generate the fowllowing error
File "/usr/local/lib/python3.10/dist-packages/vllm/model_executor/parameter.py", line 133, in load_qkv_weight
assert param_data.shape == loaded_weight.shape
AssertionError

torch.Size([6144, 768]) != torch.Size([4608, 768])

Sign up or log in to comment