No cfg?

by wang0422 - opened 4 days ago

4 days ago

https://github.com/Tencent/HunyuanVideo/blob/main/hyvideo/diffusion/pipelines/pipeline_hunyuan_video.py#L1012
The official code here used classifier_free_guidance

But this part of the code is not in the diffusers code，https://github.com/huggingface/diffusers/blob/main/src/diffusers/pipelines/hunyuan_video/pipeline_hunyuan_video.py#L651

I don't know if I understand this correctly.

I have a question as to how much of a difference it makes whether or not CFG is used in dit

a-r-r-o-w

HunyuanVideo Community org 4 days ago

The released HunyuanVideo model is guidance-distilled so it does not make sense to use with true CFG here. While it may still work, it leads to unnecessary 2x more compute and memory requirements, which is why we did not implement the guidance related parts. Even in the official code examples, they only set embedded CFG values and not real CFG, so there is no interpolation between unconditional and conditional latents taking place.

a-r-r-o-w changed discussion status to closed 4 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment