Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,50 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- English
|
4 |
+
tags:
|
5 |
+
- MusicGeneration
|
6 |
+
---
|
7 |
+
|
8 |
+
<!--Copyright 2020 The HuggingFace Team. All rights reserved.
|
9 |
+
|
10 |
+
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
|
11 |
+
the License. You may obtain a copy of the License at
|
12 |
+
|
13 |
+
http://www.apache.org/licenses/LICENSE-2.0
|
14 |
+
|
15 |
+
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
|
16 |
+
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
|
17 |
+
specific language governing permissions and limitations under the License.
|
18 |
+
-->
|
19 |
+
|
20 |
+
# Jukebox
|
21 |
+
|
22 |
+
## Overview
|
23 |
+
|
24 |
+
The Jukebox model was proposed in [Jukebox: A generative model for music](https://arxiv.org/pdf/2005.00341.pdf)
|
25 |
+
by Prafulla Dhariwal, Heewoo Jun, Christine Payne, Jong Wook Kim, Alec Radford,
|
26 |
+
Ilya Sutskever.
|
27 |
+
|
28 |
+
This model proposes a generative music model which can be produce minute long samples which can bne conditionned on
|
29 |
+
artist, genre and lyrics.
|
30 |
+
|
31 |
+
The abstract from the paper is the following:
|
32 |
+
|
33 |
+
We introduce Jukebox, a model that generates
|
34 |
+
music with singing in the raw audio domain. We
|
35 |
+
tackle the long context of raw audio using a multiscale VQ-VAE to compress it to discrete codes,
|
36 |
+
and modeling those using autoregressive Transformers. We show that the combined model at
|
37 |
+
scale can generate high-fidelity and diverse songs
|
38 |
+
with coherence up to multiple minutes. We can
|
39 |
+
condition on artist and genre to steer the musical
|
40 |
+
and vocal style, and on unaligned lyrics to make
|
41 |
+
the singing more controllable. We are releasing
|
42 |
+
thousands of non cherry-picked samples, along
|
43 |
+
with model weights and code.
|
44 |
+
|
45 |
+
Tips:
|
46 |
+
|
47 |
+
This model is very slow for now, and takes 18h to generate a minute long audio.
|
48 |
+
|
49 |
+
This model was contributed by [Arthur Zucker](https://huggingface.co/ArthurZ).
|
50 |
+
The original code can be found [here](https://github.com/openai/jukebox).
|