Wenwan Chen's repositories
02456-deep-learning-with-PyTorch
Exercises and supplementary material for the deep learning course 02456 using PyTorch.
A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
A minimum unofficial implementation of the A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement (CRN) using PyTorch.
AM-MobileNet1D
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 architecture and the Additive Margin Softmax (AM-Softmax) loss function.)
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
asv-subtools
An Open Source Tools for Speaker Recognition
awesome-mental-health
A curated list of awesome articles, websites and resources about mental health in the software industry.
Awesome_ML_for_mental_health
A curated list of awesome work on machine learning for mental health applications. Includes topics broadly captured by affective computing. Facial expressions, speech analysis, emotion prediction, depression, interactions, psychiatry etc. etc.
crnn-audio-classification
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.
E2E-NPLDA
End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation
eng-practices
Google's Engineering Practices documentation
ignite
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
kaldiio
A pure python module for reading and writing kaldi ark files
keras-sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
kws
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
libriadapt
Instructions on downloading and using the LibriAdapt dataset
mental-health-datasets
An evolving list of electronic media data sets used to model mental-health status.
MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
myprosody
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
nnAudio
Audio processing by using pytorch 1D convolution network
Parselmouth
Praat in Python, the Pythonic way
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
tensorflow
An Open Source Machine Learning Framework for Everyone
VAD-python
Voice Activity Detector in Python
Voice-Privacy-Challenge-2020
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (40+ datasets).
youtube-dl
Command-line program to download videos from YouTube.com and other video sites
Zoom-Automation-Python
This project sign into your zoom meetings / classes on time automatically for you.