hbwu-ntu

hbwu-ntu

Geek Repo

Company:National Taiwan University

Location:Seattle, WA, US

Home Page:https://hbwu-ntu.github.io/

Github PK Tool:Github PK Tool


Organizations
s3prl

hbwu-ntu's repositories

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

AdvAttacksASVspoof

This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".

Language:PythonStargazers:1Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

AudioDecBenchmark

Audio Codec Benchmark

Language:PythonStargazers:1Issues:0Issues:0

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

FAcodec

Training code for FAcodec presented in NaturalSpeech3

Stargazers:1Issues:0Issues:0

hbwu-ntu.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:1Issues:0Issues:0

hbwu.github.io

Haibin's homepage

Language:JavaScriptLicense:MITStargazers:1Issues:0Issues:0

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

Stargazers:1Issues:0Issues:0

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

audiowmark

Audio Watermarking

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

CMGAN

Conformer-based Metric GAN for speech enhancement

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

dns_mos_calculate

Code for calculate DNS_MOS.

Language:PythonStargazers:0Issues:0Issues:0

dynamic-superb

The official repository of Dynamic-SUPERB.

Language:PythonStargazers:0Issues:0Issues:0

FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

GaGNet

This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement", which was accepted by Elsevier Applied Acoustics.

Language:PythonStargazers:0Issues:0Issues:0

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Loss-Gated-Learning

ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Matcha-TTS

🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

MetricGAN-OKD

Official PyTorch implementation of "Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement" (ICML 2023)

License:Apache-2.0Stargazers:0Issues:0Issues:0

ML2021-Spring

**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

MTFAA-Net

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Language:PythonStargazers:0Issues:0Issues:0

mvp

NeurIPS-2021: Direct Multi-view Multi-person 3D Human Pose Estimation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SE-TFCN

语音增强TFCN论文复现

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

License:MITStargazers:0Issues:0Issues:0

wavmark

AI-based Audio Watermarking Tool

License:MITStargazers:0Issues:0Issues:0