can not run it on V100

by CrazyAIGC - opened Feb 18, 2023

Discussion

CrazyAIGC

Feb 18, 2023

This comment has been hidden

nielsr

Feb 18, 2023

•

edited Feb 18, 2023

You need to also put your model on the GPU, i.e. model.to("cuda")

jacobzlogar

Feb 23, 2023

may not be able to on a 16gb V100

nielsr

Feb 23, 2023

•

edited Feb 23, 2023

This model requires about 30 GB of GPU RAM if you use 8 bit inference (i.e. pass load_in_8bit=True to from_pretrained)

So I'd recommend to check out smaller BLIP-2 variants.

zhouqh

Mar 12, 2023

I use 8 bit inference on my 32gb V100 but failed.
I've already converted the input to fp16, but this bug still occurred.
AssertionError: The input data type needs to be fp16 but torch.float32 was found!
Has anyone successfully run the xxl model on the V100?