[KIT] Music to Image • v1 - a fffiloni Collection

fffiloni 's Collections

Video Understanding & Segmentation

Colorization Tasks

LipSync and Face Operations

🚂 SD-XL Training Suite

🎵 The MusicBox

Sora Reference Papers

🕹️ AI Games

Text-to-Image History

🎦🔀 Useful Tiny Video Converters

Historic Top Trending Demos

Video History [WIP]

The ControlNet Saga

3D Modelization

[KIT] Music to Image • v1

UpScale / Enhancers

[KIT] Music to Image • v1

updated Oct 3, 2024

Everything you need to reproduce my Music-to-Image demo

Running

156

🎵🎵🎵

Lp Music Caps

Note Will describe music mood
Runtime error

201

⚡

Demucs

Note Optional: will separate different audio tracks; used here to get song voice only which is then passed through whisper
Runtime error

187

🤫

Whisper Large V2

Note Whisper will transcribe lyrics
meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 4.23k

Note Llama is the major part: will use LP-Music-Cap + optional lyrics transcription to write an image description that should match your music input, according to the previous steps
stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 2.6M • 6.24k

Note Llama just gave an image description, use it to generate an image with SDXL model
Paused

265

🎶🌅

Music To Image