hbwu-ntu

hbwu-ntu's repositories

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Language:PythonMIT500

AdvAttacksASVspoof

This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".

Language:Python100

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT100

AudioDecBenchmark

Audio Codec Benchmark

Language:Python100

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Language:PythonMIT100

FAcodec

Training code for FAcodec presented in NaturalSpeech3

100

hbwu-ntu.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT100

hbwu.github.io

Haibin's homepage

Language:JavaScriptMIT100

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

100

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language:PythonNOASSERTION100

audiowmark

Audio Watermarking

Language:C++GPL-3.0000

CMGAN

Conformer-based Metric GAN for speech enhancement

Language:PythonMIT000

DistillLoss

Language:Python010

dns_mos_calculate

Code for calculate DNS_MOS.

Language:Python000

dynamic-superb

The official repository of Dynamic-SUPERB.

Language:Python000

FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Language:PythonApache-2.0000

GaGNet

This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement", which was accepted by Elsevier Applied Acoustics.

Language:Python000