Image-to-Image
Diffusers
English

what's the difference with V1?

#8
by flankechen - opened

https://huggingface.co/XLabs-AI/flux-ip-adapter

any more tech detail or report?

XLabs AI org

So, 500k steps vs 75k, and 13x larger dataset, 16 visual tokens instead of 4 in v1

So, 500k steps vs 75k, and 13x larger dataset, 16 visual tokens instead of 4 in v1

thanks, is the clip image projector still linear+layernorm as the original ipadapter paper base model?
would you try to train with plus like, resampler model?

XLabs AI org

no, only default version

Sign up or log in to comment