Wenwan Chen (WenwanChen)

WenwanChen

Geek Repo

Company:Rice University

Location:Houston

Github PK Tool:Github PK Tool

Wenwan Chen's repositories

02456-deep-learning-with-PyTorch

Exercises and supplementary material for the deep learning course 02456 using PyTorch.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement

A minimum unofficial implementation of the A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement (CRN) using PyTorch.

Language:PythonStargazers:0Issues:1Issues:0

AM-MobileNet1D

The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 architecture and the Additive Margin Softmax (AM-Softmax) loss function.)

Language:PythonStargazers:0Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

License:MITStargazers:0Issues:0Issues:0

asv-subtools

An Open Source Tools for Speaker Recognition

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-mental-health

A curated list of awesome articles, websites and resources about mental health in the software industry.

License:CC0-1.0Stargazers:0Issues:0Issues:0

Awesome_ML_for_mental_health

A curated list of awesome work on machine learning for mental health applications. Includes topics broadly captured by affective computing. Facial expressions, speech analysis, emotion prediction, depression, interactions, psychiatry etc. etc.

Stargazers:0Issues:0Issues:0

crnn-audio-classification

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

License:MITStargazers:0Issues:0Issues:0

data-augmentation-review

List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.

Stargazers:0Issues:0Issues:0

E2E-NPLDA

End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation

Stargazers:0Issues:0Issues:0

eng-practices

Google's Engineering Practices documentation

License:NOASSERTIONStargazers:0Issues:0Issues:0

ignite

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

License:MITStargazers:0Issues:0Issues:0

kaldiio

A pure python module for reading and writing kaldi ark files

License:NOASSERTIONStargazers:0Issues:0Issues:0

keras-sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stargazers:0Issues:0Issues:0

kws

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection

License:MITStargazers:0Issues:0Issues:0

libriadapt

Instructions on downloading and using the LibriAdapt dataset

Stargazers:0Issues:0Issues:0

mental-health-datasets

An evolving list of electronic media data sets used to model mental-health status.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

License:MITStargazers:0Issues:0Issues:0

myprosody

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

License:MITStargazers:0Issues:0Issues:0

nnAudio

Audio processing by using pytorch 1D convolution network

License:MITStargazers:0Issues:0Issues:0

Parselmouth

Praat in Python, the Pythonic way

License:GPL-3.0Stargazers:0Issues:0Issues:0

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

License:NOASSERTIONStargazers:0Issues:0Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

License:Apache-2.0Stargazers:0Issues:0Issues:0

VAD-python

Voice Activity Detector in Python

Stargazers:0Issues:0Issues:0

Voice-Privacy-Challenge-2020

Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf

Stargazers:0Issues:0Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (40+ datasets).

Stargazers:0Issues:0Issues:0

youtube-dl

Command-line program to download videos from YouTube.com and other video sites

License:UnlicenseStargazers:0Issues:0Issues:0

Zoom-Automation-Python

This project sign into your zoom meetings / classes on time automatically for you.

Stargazers:0Issues:0Issues:0