雲夢's repositories
waveglow_vocoder
A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.
midi_merge
Merge two midi file into a single file with two tracks.
aishell-3-baseline-fc-1
The code for aishell-3 baseline acoustic model
AutoSpeech
The 1st place solution for AutoSpeech 2019.
autospeech19
3rd place solution of autospeech 2019
AutoSpeech2019
Solution for AutoSpeech Challenge 2019
DCASE2020-Task1
Jupyter notebook for DCASE 2020 challenge Task 1
DCASE2020_task1
Code for DCASE 2020 task 1a and task 1b.
Gender-Classification
Gender Classification of Speech Signals
Markdown-Resume-Template
BAT程序员自己的简历模板分享出来了 。技术简历追求简单明了,避免没有必要的花哨修饰,大家可以fork到自己仓库中,基于这个模板进行修改。
NLNL-Negative-Learning-for-Noisy-Labels
NLNL: Negative Learning for Noisy Labels
OpenTransformer
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
pase
Problem Agnostic Speech Encoder
pitch_jitter_shimmer
Using praat to get pitch, jitter and shimmer parameters of voice file.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
ShadowsocksBio
记录一下SS的前世今生,以及一个简单的教程总结
SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Spleeter_Android_iOS
Spleeter (Audio Seperation) NN models for Android / iOS APP
videoprocess
CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.