HudsonHuang

雲夢's repositories

waveglow_vocoder

A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.

Language:PythonBSD-3-Clause12 3 3

yata

Yet Another Tools for Audio deep learning(for myself).

Language:PythonMIT6 30

ACA-Code

Matlab scripts accompanying the book "An Introduction to Audio Content Analysis" (www.AudioContentAnlysis.org)

Language:MATLABMIT1 10

midi_merge

Merge two midi file into a single file with two tracks.

Language:Python1 20

aishell-3-baseline-fc-1

The code for aishell-3 baseline acoustic model

Language:Jupyter NotebookMIT010

AutoSpeech

The 1st place solution for AutoSpeech 2019.

Language:PythonGPL-3.0010

autospeech19

3rd place solution of autospeech 2019

Language:PythonMIT010

AutoSpeech2019

Solution for AutoSpeech Challenge 2019

Language:PythonApache-2.0010

autotuner

000

DCASE2020-Task1

Jupyter notebook for DCASE 2020 challenge Task 1

MIT000

DCASE2020_task1

Code for DCASE 2020 task 1a and task 1b.

Language:PythonMIT010

g2pM

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Language:PythonApache-2.0010

Gender-Classification

Gender Classification of Speech Signals

Language:Jupyter Notebook010

Lenia

Lenia - Mathematical Life Forms

Language:PythonMIT020

Markdown-Resume-Template

BAT程序员自己的简历模板分享出来了。技术简历追求简单明了，避免没有必要的花哨修饰，大家可以fork到自己仓库中，基于这个模板进行修改。

010

NLNL-Negative-Learning-for-Noisy-Labels

NLNL: Negative Learning for Noisy Labels

Language:Python020

nnAudio

Audio processing by using pytorch 1D convolution network

Language:Jupyter NotebookMIT020

OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

Language:PythonMIT010

pase

Problem Agnostic Speech Encoder

000

pitch_jitter_shimmer

Using praat to get pitch, jitter and shimmer parameters of voice file.

MIT000

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION020

Realtime_AudioDenoise_EchoCancellation

Language:C++NOASSERTION010

ShadowsocksBio

记录一下SS的前世今生，以及一个简单的教程总结

CC-BY-SA-4.0010

SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Language:PythonApache-2.0020

spleeter

Deezer source separation library including pretrained models.

Language:PythonMIT010

Spleeter_Android_iOS

Spleeter (Audio Seperation) NN models for Android / iOS APP

020

tacotron2

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

Language:Jupyter NotebookBSD-3-Clause010

videoprocess

CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.

Language:Python010

wechat-chatgpt

Language:TypeScript010

zhrtvc

中文语音克隆兼语音合成系统。Zhongwen real time voice cloning and Chinese TTS.

Language:Python020