Apply for community grant: Academic project (gpu)

#1
by crypto-code - opened
Music Understanding and Generation org
โ€ข
edited Jan 3

The M2UGen (https://arxiv.org/abs/2311.11255) model is a Music Understanding and Generation model that is capable of Music Question Answering and also Music Generation from texts, images, videos and audios, as well as Music Editing. The model utilizes encoders such as MERT for music understanding, ViT for image understanding and ViViT for video understanding and the MusicGen/AudioLDM2 model as the music generation model (music decoder), coupled with adapters and the LLaMA 2 model to make the model capable of multiple abilities.

We would like to apply for the GPU grant for two A10G GPUs in order to make our model accessible to the HuggingFace community, helping users generate music from texts, images, videos and audios.

Hi @crypto-code , we have assigned a gpu to this space. Note that GPU Grants are provided temporarily and might be removed after some time if the usage is very low.

To learn more about GPUs in Spaces, please check out https://huggingface.co/docs/hub/spaces-gpus

Music Understanding and Generation org

Thank you @hysts , We will have the demo up and running asap.

It would be amazing to have some examples so users can quickly get started!

Sign up or log in to comment