Qwen-Image-ControlNet-Inpainting

Running on Zero

App Files Files Community

Utilize HF's "balanced" device_map + dynamically pair diffusion components to relevant execution cores

by diopside - opened 2 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+34

-7

diopside

2 days ago

•

edited 2 days ago

By utilizing balanced mode + explicitly pairing diffusion components on grouped GPUs, we avoid OOM and being able to run on 4*40Ls.
Distribution approach (i.e): Text encoder on GPU 1 - 16.6GB, Everything else on GPU 2 - 44.5GB including: Controlnet (4.23GB), VAE (254MB), Transformer (40GB).
This keeps the overall memory usage efficiently split across the GPUs while ensuring all components that need to interact directly are on the same device.

Utilize HF's "balanced" device_map + dynamically pair diffusion components to relevant execution cores48fd1f54

diopside

2 days ago

@instantx-admin @linoyts shabbat shalom, please review 😄

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment