StreamingSVD

StreamingSVD: Consistent, Dynamic, and Extendable Image-Guided Long Video Generation
Roberto Henschel, Levon Khachatryan, Daniil Hayrapetyan, Hayk Poghosyan, Vahram Tadevosyan, Zhangyang Wang, Shant Navasardyan, Humphrey Shi

Video | Project page | Code

🔥 Meet StreamingSVD - A StreamingT2V Method

StreamingSVD is an advanced autoregressive technique for image-to-video generation, generating long hiqh-quality videos with rich motion dynamics, turning SVD into a long video generator. Our method ensures temporal consistency throughout the video, aligns closely to the input image, and maintains high frame-level image quality. Our demonstrations include successful examples of videos up to 200 frames, spanning 8 seconds, and can be extended for even longer durations. The effectiveness of the underlying autoregressive approach is not limited to the specific base model used, indicating that improvements in base models can yield even higher-quality videos. StreamingSVD is part of the StreamingT2V family.

BibTeX

If you use our work in your research, please cite our publications:

StreamingSVD paper comming soon.

@article{henschel2024streamingt2v,
  title={StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text},
  author={Henschel, Roberto and Khachatryan, Levon and Hayrapetyan, Daniil and Poghosyan, Hayk and Tadevosyan, Vahram and Wang, Zhangyang and Navasardyan, Shant and Shi, Humphrey},
  journal={arXiv preprint arXiv:2403.14773},
  year={2024}
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Safetensors

Model size

3B params

Tensor type

F32

Model tree for PAIR/StreamingSVD

Finetunes

1 model

Paper for PAIR/StreamingSVD

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Paper • 2403.14773 • Published Mar 21, 2024 • 11