flymin's picture
Update README.md
1ea6ea5 verified
---
license: gpl-3.0
language:
- en
tags:
- magicdrive
- video-generation
- autonomous-driving
---
# MagicDriveDiT
<p style="text-align: center;">
<p align="center">
<a href="https://arxiv.org/abs/2411.13807">πŸ“„ Paper</a> |
<a href="https://gaoruiyuan.com/magicdrivedit/">🌐 Website</a> |
<a href="https://github.com/flymin/MagicDriveDiT/blob/main/LICENSE">πŸ“– LICENSE</a> |
<a href="https://github.com/flymin/MagicDriveDiT">πŸ€– GitHub</a>
</p>
This repository contains the model checkpoint of the paper.
This model is fine-tuned for 10k steps with lr=1e-5 after 40k steps of training for stage-3.
Note:
- For inference/testing, we recommend the ema model, i.e., `ema.pt`.
- For fine-tuning, you can consider to use the training checkpoint, i.e., `model/*`
> MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control <br>
> [Ruiyuan Gao](https://gaoruiyuan.com/)<sup>1</sup>, [Kai Chen](https://kaichen1998.github.io/)<sup>2</sup>, [Bo Xiao](https://www.linkedin.com/in/bo-xiao-19909955/?originalSubdomain=ie)<sup>3</sup>, [Lanqing Hong](https://scholar.google.com.sg/citations?user=2p7x6OUAAAAJ&hl=en)<sup>4</sup>, [Zhenguo Li](https://scholar.google.com/citations?user=XboZC1AAAAAJ&hl=en)<sup>4</sup>, [Qiang Xu](https://cure-lab.github.io/)<sup>1</sup><br>
> <sup>1</sup>CUHK <sup>2</sup>HKUST <sup>3</sup>Huawei Cloud <sup>4</sup>Huawei Noah's Ark Lab
<div style="text-align: center;">
<video width="100%" controls style="margin: auto; max-width:1080px">
<source src="https://github.com/user-attachments/assets/f43812ea-087b-4b70-883b-1e2f1c0df8d7" type="video/mp4">
</video>
</div>
Please find more information on our GitHub: https://github.com/flymin/MagicDriveDiT