flymin
/

MagicDriveDiT-stage3-40k-ft

video-generation

autonomous-driving

Model card Files Files and versions Community

MagicDriveDiT-stage3-40k-ft / README.md

flymin's picture

Update README.md

1ea6ea5 verified about 2 months ago

|

history blame contribute delete

1.76 kB

	---
	license: gpl-3.0
	language:
	- en
	tags:
	- magicdrive
	- video-generation
	- autonomous-driving
	---

	# MagicDriveDiT

	<p style="text-align: center;">
	<p align="center">
	<a href="https://arxiv.org/abs/2411.13807">📄 Paper</a> \|
	<a href="https://gaoruiyuan.com/magicdrivedit/">🌐 Website</a> \|
	<a href="https://github.com/flymin/MagicDriveDiT/blob/main/LICENSE">📖 LICENSE</a> \|
	<a href="https://github.com/flymin/MagicDriveDiT">🤖 GitHub</a>
	</p>

	This repository contains the model checkpoint of the paper.

	This model is fine-tuned for 10k steps with lr=1e-5 after 40k steps of training for stage-3.

	Note:
	- For inference/testing, we recommend the ema model, i.e., `ema.pt`.
	- For fine-tuning, you can consider to use the training checkpoint, i.e., `model/*`

	> MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control <br>
	> [Ruiyuan Gao](https://gaoruiyuan.com/)<sup>1</sup>, [Kai Chen](https://kaichen1998.github.io/)<sup>2</sup>, [Bo Xiao](https://www.linkedin.com/in/bo-xiao-19909955/?originalSubdomain=ie)<sup>3</sup>, [Lanqing Hong](https://scholar.google.com.sg/citations?user=2p7x6OUAAAAJ&hl=en)<sup>4</sup>, [Zhenguo Li](https://scholar.google.com/citations?user=XboZC1AAAAAJ&hl=en)<sup>4</sup>, [Qiang Xu](https://cure-lab.github.io/)<sup>1</sup><br>
	> <sup>1</sup>CUHK <sup>2</sup>HKUST <sup>3</sup>Huawei Cloud <sup>4</sup>Huawei Noah's Ark Lab

	<div style="text-align: center;">
	<video width="100%" controls style="margin: auto; max-width:1080px">
	<source src="https://github.com/user-attachments/assets/f43812ea-087b-4b70-883b-1e2f1c0df8d7" type="video/mp4">
	</video>
	</div>

	Please find more information on our GitHub: https://github.com/flymin/MagicDriveDiT