KALEIDO: OPEN-SOURCED MULTI-SUBJECT REFERENCE VIDEO GENERATION MODEL

This repository contains the official implementation of Kaleido, proposed in our paper:

Update and News

  • 2025.10.28: πŸ”₯ We release the checkpoints of Kaleido-14B-S2V.
  • 2025.10.22: πŸ”₯ We propose Kaleido, a novel multi-subject reference video generation model.

Checkpoints Download

ckpts Download Link Notes
Kaleido-14B πŸ€— Huggingface Supports 512P

Use the following commands to download the model weights (We have integrated both Wan VAE and T5 modules into this checkpoint for convenience).

# Download the repository (skip automatic LFS file downloads)
GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/Crilias/Kaleido-14B-S2V

# Enter the repository folder
cd Kaleido-14B-S2V

# Merge the checkpoint files
python merge_kaleido.py

Arrange the model files into the following structure:

.
β”œβ”€β”€ Kaleido-14B-S2V
β”‚   β”œβ”€β”€ model
β”‚   β”‚   └── ....
β”‚   β”œβ”€β”€ Wan2.1_VAE.pth
β”‚   β”‚
β”‚   └── umt5-xxl
β”‚       └── ....
β”œβ”€β”€ configs
β”œβ”€β”€ sat
└── sgm

Citation

If you find our work helpful, please cite our paper:

@misc{zhang2025kaleidoopensourcedmultisubjectreference,
      title={Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model}, 
      author={Zhenxing Zhang and Jiayan Teng and Zhuoyi Yang and Tiankun Cao and Cheng Wang and Xiaotao Gu and Jie Tang and Dan Guo and Meng Wang},
      year={2025},
      eprint={2510.18573},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2510.18573}, 
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support