svjack
/

Genshin_Impact_ZhongLi_NingGuang_Couple_HunyuanVideo_lora

Model card Files Files and versions Community

svjack commited on Jan 1

Commit

f080048

verified ·

1 Parent(s): bce3e7c

Create README.md

Browse files

Files changed (1) hide show

README.md +123 -0

README.md ADDED Viewed

	@@ -0,0 +1,123 @@

+# Genshin Impact Zhongli & Ningguang HunyuanVideo LoRA
+This repository contains the necessary setup and scripts to generate videos using the HunyuanVideo model with a LoRA (Low-Rank Adaptation) fine-tuned for Genshin Impact's characters Zhongli and Ningguang. Below are the instructions to install dependencies, download models, and run the demo.
+---
+## Installation
+### Step 1: Install System Dependencies
+Run the following command to install required system packages:
+```bash
+sudo apt-get update && sudo apt-get install git-lfs ffmpeg cbm
+```
+### Step 2: Clone the Repository
+Clone the repository and navigate to the project directory:
+```bash
+git clone https://huggingface.co/svjack/Genshin_Impact_ZhongLi_NingGuang_Couple_HunyuanVideo_lora
+cd Genshin_Impact_ZhongLi_NingGuang_Couple_HunyuanVideo_lora
+```
+### Step 3: Install Python Dependencies
+Install the required Python packages:
+```bash
+conda create -n py310 python=3.10
+conda activate py310
+pip install ipykernel
+python -m ipykernel install --user --name py310 --display-name "py310"
+pip install -r requirements.txt
+pip install ascii-magic matplotlib tensorboard huggingface_hub
+pip install moviepy==1.0.3
+pip install sageattention==1.0.6
+```
+---
+## Download Models
+### Step 1: Download HunyuanVideo Model
+Download the HunyuanVideo model and place it in the `ckpts` directory:
+```bash
+huggingface-cli download tencent/HunyuanVideo --local-dir ./ckpts
+```
+### Step 2: Download LLaVA Model
+Download the LLaVA model and preprocess it:
+```bash
+cd ckpts
+huggingface-cli download xtuner/llava-llama-3-8b-v1_1-transformers --local-dir ./llava-llama-3-8b-v1_1-transformers
+wget https://raw.githubusercontent.com/Tencent/HunyuanVideo/refs/heads/main/hyvideo/utils/preprocess_text_encoder_tokenizer_utils.py
+python preprocess_text_encoder_tokenizer_utils.py --input_dir llava-llama-3-8b-v1_1-transformers --output_dir text_encoder
+```
+### Step 3: Download CLIP Model
+Download the CLIP model for the text encoder:
+```bash
+huggingface-cli download openai/clip-vit-large-patch14 --local-dir ./text_encoder_2
+```
+---
+## Demo
+### Generate Video 1: Zhongli & Ningguang Cooking Rice
+Run the following command to generate a video of Zhongli and Ningguang cooking rice:
+```bash
+python hv_generate_video.py \
+    --fp8 \
+    --video_size 544 960 \
+    --video_length 60 \
+    --infer_steps 30 \
+    --prompt "ZHONGLI\\(genshin impact\\) with NING GUANG\\(genshin impact\\) in red cheongsam. cook rice in a pot" \
+    --save_path . \
+    --output_type both \
+    --dit ckpts/hunyuan-video-t2v-720p/transformers/mp_rank_00_model_states.pt \
+    --attn_mode sdpa \
+    --vae ckpts/hunyuan-video-t2v-720p/vae/pytorch_model.pt \
+    --vae_chunk_size 32 \
+    --vae_spatial_tile_sample_min_size 128 \
+    --text_encoder1 ckpts/text_encoder \
+    --text_encoder2 ckpts/text_encoder_2 \
+    --seed 1234 \
+    --lora_multiplier 1.0 \
+    --lora_weight zhongli_ningguang_couple_im_lora_dir/zhongli_ningguang_couple_im_lora-000012.safetensors
+```
+<video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/634dffc49b777beec3bc6448/OHfEzY_Ernp778vMC-K8D.mp4"></video>
+### Generate Video 2: Zhongli Drinking Tea
+Run the following command to generate a video of Zhongli drinking water:
+```bash
+python hv_generate_video.py \
+    --fp8 \
+    --video_size 544 960 \
+    --video_length 60 \
+    --infer_steps 30 \
+    --prompt "ZHONGLI\\(genshin impact\\). drink tea" \
+    --save_path . \
+    --output_type both \
+    --dit ckpts/hunyuan-video-t2v-720p/transformers/mp_rank_00_model_states.pt \
+    --attn_mode sdpa \
+    --vae ckpts/hunyuan-video-t2v-720p/vae/pytorch_model.pt \
+    --vae_chunk_size 32 \
+    --vae_spatial_tile_sample_min_size 128 \
+    --text_encoder1 ckpts/text_encoder \
+    --text_encoder2 ckpts/text_encoder_2 \
+    --seed 1234 \
+    --lora_multiplier 1.0 \
+    --lora_weight zhongli_ningguang_couple_im_lora_dir/zhongli_ningguang_couple_im_lora-000012.safetensors
+```
+---
+## Notes
+- Ensure you have sufficient GPU resources for video generation.
+- Adjust the `--video_size`, `--video_length`, and `--infer_steps` parameters as needed for different output qualities and lengths.
+- The `--prompt` parameter can be modified to generate videos with different scenes or actions.
+---