hbwu-ntu's repositories
AdvAttacksASVspoof
This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".
AudioDecBenchmark
Audio Codec Benchmark
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
hbwu-ntu.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
hbwu.github.io
Haibin's homepage
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
audiowmark
Audio Watermarking
CMGAN
Conformer-based Metric GAN for speech enhancement
dns_mos_calculate
Code for calculate DNS_MOS.
dynamic-superb
The official repository of Dynamic-SUPERB.
FullSubNet-plus
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
GaGNet
This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement", which was accepted by Elsevier Applied Acoustics.
InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow
Loss-Gated-Learning
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
Matcha-TTS
🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
MetricGAN-OKD
Official PyTorch implementation of "Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement" (ICML 2023)
ML2021-Spring
**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring
MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
mvp
NeurIPS-2021: Direct Multi-view Multi-person 3D Human Pose Estimation
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
SE-TFCN
语音增强TFCN论文复现
SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
wavmark
AI-based Audio Watermarking Tool