Create music captions from audio files
Generate music from text prompts
Generate chat responses from user input