groadabike's picture
Upload README.md with huggingface_hub
405a879 verified
---
language:
- en
license: apache-2.0
tags:
- hearing loss
- challenge
- signal processing
- source separation
- audio
- audio-to-audio
- NonCausal
---
# Cadenza Challenge: CAD2-Task1
A NonCausal Flute/Others separation model for the CAD2-Task2 baseline system.
* Architecture: ConvTasNet (Kaituo XU) with multichannel support (Alexandre Defossez).
* Parameters:
* B: 256
* C: 2
* H: 512
* L: 20
* N: 256
* P: 3
* R: 3
* X: 8
* audio_channels: 2
* causal: false
* mask_nonlinear: relu
* norm_type: gLN
* training:
* sample_rate: 44100
* samples_per_track: 64
* segment: 5.0
* aggregate: 2
* batch_size: 4
* early_stop: true
* epochs: 200
## Dataset
The model was trained using EnsembleSet and CadenzaWoodwind datasets.
## How to use
```
from tasnet import ConvTasNetStereo
model = ConvTasNetStereo.from_pretrained(
"cadenzachallenge/ConvTasNet_Flute_NonCausal"
).cpu()
```