Spaces:
Running
Running
File size: 7,055 Bytes
681fa96 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 |
# ComfyUI InstantID (Native Support)
## Translations
- [简体中文 (Simplified Chinese)](./README.zh-CN.md)
Native [InstantID](https://github.com/InstantID/InstantID) support for [ComfyUI](https://github.com/comfyanonymous/ComfyUI).
This extension differs from the many already available as it doesn't use *diffusers* but instead implements InstantID natively and it fully integrates with ComfyUI.
# Sponsorship
<div align="center">
**[:heart: Github Sponsor](https://github.com/sponsors/cubiq) | [:coin: Paypal](https://paypal.me/matt3o)**
</div>
If you like my work and wish to see updates and new features please consider sponsoring my projects.
- [ComfyUI IPAdapter Plus](https://github.com/cubiq/ComfyUI_IPAdapter_plus)
- [ComfyUI InstantID (Native)](https://github.com/cubiq/ComfyUI_InstantID)
- [ComfyUI Essentials](https://github.com/cubiq/ComfyUI_essentials)
- [ComfyUI FaceAnalysis](https://github.com/cubiq/ComfyUI_FaceAnalysis)
Not to mention the documentation and videos tutorials. Check my **ComfyUI Advanced Understanding** videos on YouTube for example, [part 1](https://www.youtube.com/watch?v=_C7kR2TFIX0) and [part 2](https://www.youtube.com/watch?v=ijqXnW_9gzc)
The only way to keep the code open and free is by sponsoring its development. The more sponsorships the more time I can dedicate to my open source projects.
Please consider a [Github Sponsorship](https://github.com/sponsors/cubiq) or [PayPal donation](https://paypal.me/matt3o) (Matteo "matt3o" Spinelli). For sponsorships of $50+, let me know if you'd like to be mentioned in this readme file, you can find me on [Discord](https://latent.vision/discord) or _matt3o :snail: gmail.com_.
## Important updates
- **2024/02/27:** Added [noise injection](#noise-injection) in the negative embeds.
- **2024/02/26:** Fixed a small but nasty bug. Results will be different and you may need to lower the CFG.
- **2024/02/20:** I refactored the nodes so they are hopefully easier to use. **This is a breaking update**, the previous workflows won't work anymore.
## Basic Workflow
In the `examples` directory you'll find some basic workflows.
![workflow](examples/instantid_basic_workflow.jpg)
## Video Tutorial
<a href="https://youtu.be/wMLiGhogOPE" target="_blank">
<img src="https://img.youtube.com/vi/wMLiGhogOPE/hqdefault.jpg" alt="Watch the video" />
</a>
** :movie_camera: [Introduction to InstantID features](https://youtu.be/wMLiGhogOPE)**
## Installation
**Upgrade ComfyUI to the latest version!**
Download or `git clone` this repository into the `ComfyUI/custom_nodes/` directory or use the Manager.
InstantID requires `insightface`, you need to add it to your libraries together with `onnxruntime` and `onnxruntime-gpu`.
The InsightFace model is **antelopev2** (not the classic buffalo_l). Download the models (for example from [here](https://drive.google.com/file/d/18wEUfMNohBJ4K3Ly5wpTejPfDzp-8fI8/view?usp=sharing) or [here](https://huggingface.co/MonsterMMORPG/tools/tree/main)), unzip and place them in the `ComfyUI/models/insightface/models/antelopev2` directory.
The **main model** can be downloaded from [HuggingFace](https://huggingface.co/InstantX/InstantID/resolve/main/ip-adapter.bin?download=true) and should be placed into the `ComfyUI/models/instantid` directory. (Note that the model is called *ip_adapter* as it is based on the [IPAdapter](https://github.com/tencent-ailab/IP-Adapter)).
You also needs a [controlnet](https://huggingface.co/InstantX/InstantID/resolve/main/ControlNetModel/diffusion_pytorch_model.safetensors?download=true), place it in the ComfyUI controlnet directory.
**Remember at the moment this is only for SDXL.**
## Watermarks!
The training data is full of watermarks, to avoid them to show up in your generations use a resolution slightly different from 1024×1024 (or the standard ones) for example **1016×1016** works pretty well.
## Lower the CFG!
It's important to lower the CFG to at least 4/5 or you can use the `RescaleCFG` node.
## Face keypoints
The person is posed based on the keypoints generated from the reference image. You can use a different pose by sending an image to the `image_kps` input.
<img src="examples/daydreaming.jpg" width="386" height="386" alt="Day Dreaming" />
## Noise Injection
The default InstantID implementation seems to really burn the image, I find that by injecting noise to the negative embeds we can mitigate the effect and also increase the likeliness to the reference. The default Apply InstantID node automatically injects 35% noise, if you want to fine tune the effect you can use the Advanced InstantID node.
This is still experimental and may change in the future.
## Additional Controlnets
You can add more controlnets to the generation. An example workflow for depth controlnet is provided.
## Styling with IPAdapter
It's possible to style the composition with IPAdapter. An example is provided.
<img src="examples/instant_id_ipadapter.jpg" width="512" alt="IPAdapter" />
## Multi-ID
Multi-ID is supported but the workflow is a bit complicated and the generation slower. I'll check if I can find a better way of doing it. The "hackish" workflow is provided in the example directory.
<img src="examples/instantid_multi_id.jpg" width="768" alt="IPAdapter" />
## Advanced Node
There's an InstantID advanced node available, at the moment the only difference with the standard one is that you can set the weights for the instantID models and the controlnet separately. It now also includes a noise injection option. It might be helpful for finetuning.
The instantID model influences the composition of about 25%, the rest is the controlnet.
The noise helps reducing the "burn" effect.
## Other notes
It works very well with SDXL Turbo/Lighting. Best results with community's checkpoints.
## Current sponsors
It's only thanks to generous sponsors that **the whole community** can enjoy open and free software. Please join me in thanking the following companies and individuals!
### :trophy: Gold sponsors
[![Kaiber.ai](https://f.latent.vision/imgs/kaiber.png)](https://kaiber.ai/) [![InstaSD](https://f.latent.vision/imgs/instasd.png)](https://www.instasd.com/)
### :tada: Silver sponsors
[![OperArt.ai](https://f.latent.vision/imgs/openart.png?r=1)](https://openart.ai/workflows) [![Finetuners](https://f.latent.vision/imgs/finetuners.png)](https://www.finetuners.ai/) [![Comfy.ICU](https://f.latent.vision/imgs/comfyicu.png?r=1)](https://comfy.icu/)
### Other companies supporting my projects
- [RunComfy](https://www.runcomfy.com/) (ComfyUI Cloud)
### Esteemed individuals
- [Øystein Ø. Olsen](https://github.com/FireNeslo)
- [Jack Gane](https://github.com/ganeJackS)
- [Nathan Shipley](https://www.nathanshipley.com/)
- [Dkdnzia](https://github.com/Dkdnzia)
[And all my public and private sponsors!](https://github.com/sponsors/cubiq)
|