Takehiko's repositories
conv-tas-net
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"
Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
recommended-books
计算机经典书籍推荐 部分书籍提供PDF下载
segan-pytorch
SEGAN pytorch implementation https://arxiv.org/abs/1703.09452
Speech-enhancement
Deep neural network based speech enhancement toolkit
Adaptive_front_ends-1
Adaptive front ends
asr-evaluation
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Audio-Classification-using-CNN-MLP
Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
Audio_Classification_using_LSTM
Classification of Urban Sound Audio Dataset using LSTM-based model.
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Dysphonia-Detection-Using-Gaussian-Mixture-Models
Detecting Dysphonia (voice disorder) using I-vector feature extraction technique and modelling the same using Gaussian Mixture Models.
huobi-autotrading
火币网自动化交易工具
my-voice-analysis
My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.
Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
pytorch-book
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation
pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors
SciencePlots
Format Matplotlib for scientific plotting
segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
SoundRecognitionTCN
Sound Recognition Pipeline using Temporal Convolutional Networks
speech-gender-detection
Gender detection from speech audio file
tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
transferlearning
Everything about Transfer Learning and Domain Adaptation--迁移学习
Urban-Sound-Classification-VS
城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN
VoiceGenderRecognition
Simply python project which handles voice recognition. It allows to recognize speaker gender.
Wave-U-Net
Implementation of the Wave-U-Net for audio source separation