Spaces:

diffusers
/

compute-pipeline-size

Running

Seeking feedback

by sayakpaul HF staff - opened Mar 20, 2024

🧨Diffusers org Mar 20, 2024

Wanted to get some feedback from your end on this Space. This Space basically helps to determine the memory requirements of a DiffusionPipeline. I believe this is practically quite useful since, with this, the users can gauge the ballpark of a pipeline's memory requirement pretty easily and can plan resource allocation.

If you could provide some feedback on the Space or maybe even directly open PRs with your improvement patches, I would be grateful!

hysts

🧨Diffusers org Mar 21, 2024

Hi @sayakpaul
Looks great!
There seems to be a small bug (maybe it's just expected), but other than that, I don't have any suggestions for improvement.

sayakpaul

🧨Diffusers org Mar 21, 2024

Just fixed. Thank you for spotting!

pcuenq

🧨Diffusers org Mar 22, 2024

Very cool, I love it!

I'd include the total considering all components loaded.
Maybe repeat this sentence in the results area too: "Generation typically requires an additional 20% to these numbers, as found by EleutherAI". I know it's in the description, but people will focus on the results. Or even add a new line item with this. This will depend on the resolution, if this gets traction we could consider trying to compute a better estimate.
I noticed that memory is slightly different for bin vs safetensors, why would that be the case? (I'd maybe remove this option to simplify).

sayakpaul

🧨Diffusers org Mar 22, 2024

I noticed that memory is slightly different for bin vs safetensors, why would that be the case? (I'd maybe remove this option to simplify).

Ccing @Wauplin for this.

This will depend on the resolution, if this gets traction we could consider trying to compute a better estimate.

Good point. Will add a note about the resolution.

Wauplin

Mar 22, 2024

I noticed that memory is slightly different for bin vs safetensors, why would that be the case? (I'd maybe remove this option to simplify).

Ccing @Wauplin for this.

How is the memory computed?
Not an expert here but can it be due to shared tensors that are not handled the same between the .bin and .safetensors versions?

sayakpaul

🧨Diffusers org Mar 23, 2024

How is the memory computed?

Retrieve the files with metadata.
Use the size attribute.

Not an expert here but can it be due to shared tensors that are not handled the same between the .bin and .safetensors versions?

So, it seems like this happens at the serialization step and not at the size computation step per se.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment