Wenwan Chen's repositories
SPN-Spk-Rec
Sum-Product Networks (SPNs) for Robust Automatic Speaker Recognition.
audiosetdl
Scripts for downloading AudioSet
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
coursera-algorithms-part1
📖Coursera Princeton Algorithms Part 1
depression-detect
Predicting depression from acoustic features of speech using a Convolutional Neural Network.
depression-detection
Depression Detection Using Twitter Data - Group project for Udacity Private & Secure AI Project Showcase
kaldi-asr-aws
This code repo is in reference to the Medium Article for setting up Kaldi on AWS
kaldi-ivector
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure
laughter
Learning embeddings for laughter categorization
models
Models and examples built with TensorFlow
music163-spiders
网易云音乐歌曲评论爬虫
OQRanD-and-OQGenD
The conference paper "Asking the crowd: Asking the Crowd: Question Analysis, Evaluation and Generation for Open Discussion on Online Forums" accepted by ACL'19.
pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
reproducible-audio-research
List of Reproducible Audio Research Papers
saa_experiments
A set of experiments designed to reproduce selective auditory attention in computers.
Singing_Voice_Separation_RNN
Singing-Voice Separation From Monaural Recordings Using Deep Recurrent Neural Networks
speaker-embedding-with-phonetic-information
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
spec_augment
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners with Latest APIs
tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
TOWE
Code and data for "Target-oriented Opinion Words Extraction with Target-fused Neural Sequence Labeling" (NAACL2019)
uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
VBDiarization
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
voice-activity-detection
Voice Activity Detection (VAD) using deep learning.
voice-vector
Deep neural networks for getting text-independent speaker embedding written in TensorFlow