Beast code in Giters

Wenwan Chen's repositories

02456-deep-learning-with-PyTorch

Exercises and supplementary material for the deep learning course 02456 using PyTorch.

Language:Jupyter Notebook000

A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement

A minimum unofficial implementation of the A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement (CRN) using PyTorch.

Language:Python010

AM-MobileNet1D

The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 architecture and the Additive Margin Softmax (AM-Softmax) loss function.)

Language:Python000

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

MIT000

asv-subtools

An Open Source Tools for Speaker Recognition

Apache-2.0000

awesome-mental-health

A curated list of awesome articles, websites and resources about mental health in the software industry.

CC0-1.0000

Awesome_ML_for_mental_health

A curated list of awesome work on machine learning for mental health applications. Includes topics broadly captured by affective computing. Facial expressions, speech analysis, emotion prediction, depression, interactions, psychiatry etc. etc.

000

crnn-audio-classification

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

MIT000

data-augmentation-review

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.

000

E2E-NPLDA

End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation

000

eng-practices

Google's Engineering Practices documentation

NOASSERTION000

ignite

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

BSD-3-Clause000

inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

MIT000

kaldiio

A pure python module for reading and writing kaldi ark files

NOASSERTION000

keras-sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

000

kws

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection

MIT000

libriadapt

Instructions on downloading and using the LibriAdapt dataset

000

mental-health-datasets

An evolving list of electronic media data sets used to model mental-health status.

000

MMSE-Prediction

000

MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

MIT000