Text-to-Audio
Transformers
English
Inference Endpoints
tango / vae_config.json
declare-lab-sutd's picture
First TANGO Version
bf778ea
raw
history blame
326 Bytes
{"image_key": "fbank", "subband": 1, "embed_dim": 8, "time_shuffle": 1, "ddconfig": {"double_z": true, "z_channels": 8, "resolution": 256, "downsample_time": false, "in_channels": 1, "out_ch": 1, "ch": 128, "ch_mult": [1, 2, 4], "num_res_blocks": 2, "attn_resolutions": [], "dropout": 0.0}, "scale_factor": 0.9227914214134216}