Running
155
π΅π΅π΅
Everything you need to reproduce my Music-to-Image demo
Note Will describe music mood
Note Optional: will separate different audio tracks; used here to get song voice only which is then passed through whisper
Note Whisper will transcribe lyrics
Note Llama is the major part: will use LP-Music-Cap + optional lyrics transcription to write an image description that should match your music input, according to the previous steps
Note Llama just gave an image description, use it to generate an image with SDXL model