Rpersie / onssen

An open-source speech separation and enhancement library

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ONSSEN: An Open-source Speech Separation and Enhancement Library

Supported Models

  • Deep Clustering
  • Chimera Net
  • Chimera++
  • Phase Estimation Network
  • Speech Enhancement with Restoration Layers

Supported Dataset

Requirements

  • PyTorch
  • LibRosa
  • NumPy

Usage

You can simply use the existing config JSON file or customize your config file to train the enhancement or separation model.

python train.py -c configs/dc_config.json

Citing

If you use onssen for your research project, please cite one of the following bibtex citations:

@inproceedings {onssen,
    author = {Zhaoheng Ni and Michael Mandel},
    title = "ONSSEN: An Open-source Speech Separation and Enhancement Library",
    publisher = "under review",
    year = 2019
}

@Misc{onssen,
    author = {Zhaoheng Ni and Michael Mandel},
    title = "ONSSEN: An Open-source Speech Separation and Enhancement Library",
    howpublished = {\url{https://github.com/speechLabBcCuny/onssen}},
    year =        {2019}
}

About

An open-source speech separation and enhancement library

License:GNU General Public License v3.0


Languages

Language:Python 100.0%