|
--- |
|
license: gpl-3.0 |
|
language: |
|
- en |
|
tags: |
|
- magicdrive |
|
- video-generation |
|
- autonomous-driving |
|
--- |
|
|
|
# MagicDriveDiT |
|
|
|
<p style="text-align: center;"> |
|
<p align="center"> |
|
<a href="https://arxiv.org/abs/2411.13807">π Paper</a> | |
|
<a href="https://gaoruiyuan.com/magicdrivedit/">π Website</a> | |
|
<a href="https://github.com/flymin/MagicDriveDiT/blob/main/LICENSE">π LICENSE</a> | |
|
<a href="https://github.com/flymin/MagicDriveDiT">π€ GitHub</a> |
|
</p> |
|
|
|
This repository contains the model checkpoint of the paper. |
|
|
|
This model is fine-tuned for 10k steps with lr=1e-5 after 40k steps of training for stage-3. |
|
|
|
Note: |
|
- For inference/testing, we recommend the ema model, i.e., `ema.pt`. |
|
- For fine-tuning, you can consider to use the training checkpoint, i.e., `model/*` |
|
|
|
> MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control <br> |
|
> [Ruiyuan Gao](https://gaoruiyuan.com/)<sup>1</sup>, [Kai Chen](https://kaichen1998.github.io/)<sup>2</sup>, [Bo Xiao](https://www.linkedin.com/in/bo-xiao-19909955/?originalSubdomain=ie)<sup>3</sup>, [Lanqing Hong](https://scholar.google.com.sg/citations?user=2p7x6OUAAAAJ&hl=en)<sup>4</sup>, [Zhenguo Li](https://scholar.google.com/citations?user=XboZC1AAAAAJ&hl=en)<sup>4</sup>, [Qiang Xu](https://cure-lab.github.io/)<sup>1</sup><br> |
|
> <sup>1</sup>CUHK <sup>2</sup>HKUST <sup>3</sup>Huawei Cloud <sup>4</sup>Huawei Noah's Ark Lab |
|
|
|
<div style="text-align: center;"> |
|
<video width="100%" controls style="margin: auto; max-width:1080px"> |
|
<source src="https://github.com/user-attachments/assets/f43812ea-087b-4b70-883b-1e2f1c0df8d7" type="video/mp4"> |
|
</video> |
|
</div> |
|
|
|
Please find more information on our GitHub: https://github.com/flymin/MagicDriveDiT |
|
|