Can I load these weights into a model using 8 gpus?

by bournezz - opened Nov 4, 2022

Nov 4, 2022

I'm new to deepspeed and still don't understand how the weight sharding works. I suppose that these weights are intended for users with 4 A100 80G gpus because there are 4 groups of tp_**.pt files. Since I only have V100. I need more than 4 gpus to host these weights. Can I use these weights in my program? If not, how to re-reshard these weights into 8 partitions?

lucadiliello

Mar 21, 2023

You should be fine just by setting training_mp_size=4 in the deepspeed.init_inference. However, I see that it works even by setting nothing with deepspeed>=0.8.0. I suppose that under the hood deepspeed splits every tensor another time to obtain 8 shards.

TingchenFu

Jun 19, 2023

Hi, do you succeed at using V100 to run BLOOM inference?@bournezz How much V100s do you use?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment