xuanjihe

xuanjihe

Geek Repo

Github PK Tool:Github PK Tool

xuanjihe's repositories

speech-emotion-recognition

speech emotion recognition using a convolutional recurrent networks based on IEMOCAP

cmu-thesis

Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

Speech-Emotion-Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

tf-kaldi-speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

wav2letter

Facebook AI Research Automatic Speech Recognition Toolkit

Language:C++License:NOASSERTIONStargazers:1Issues:1Issues:0

AIR-ASVspoof

Implementation of the paper "One-class Learning towards Generalized Voice Spoofing Detection"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ASSERT

JHU's system submission to the ASVspoof 2019 Challenge: Anti-Spoofing with Squeeze-Excitation and Residual neTworks (ASSERT).

Language:MATLABLicense:MITStargazers:0Issues:1Issues:0

Auto-Tuning-Spectral-Clustering

This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:0Issues:1Issues:0

CircleLoss

Pytorch implementation of the paper "Circle Loss: A Unified Perspective of Pair Similarity Optimization"

Stargazers:0Issues:0Issues:0

Dcase2018_pooling

Repo for our pooling approach on the DCASE2018 task4

License:Apache-2.0Stargazers:0Issues:0Issues:0

deep-voice-conversion

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

Factorized-TDNN

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

GradientReversal

Gradient Reversal Layer for Domain Adaptation

Language:PythonStargazers:0Issues:0Issues:0

kaldi

This is now the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MomentumContrast.pytorch

Reproduction of Momentum Contrast for Unsupervised Visual Representation Learning

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

prefetch_generator

Simple package that makes your generator work in background thread

License:NOASSERTIONStargazers:0Issues:0Issues:0

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

QAMFace

Pytorch implementation of Quadratic Additive Angular Margin Loss for Face Recognition

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Speaker_Verification

Tensorflow implementation of generalized end-to-end loss for speaker verification

Language:PythonStargazers:0Issues:1Issues:0

spec_augment

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

License:MITStargazers:0Issues:0Issues:0

SpectralCluster

Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Speech_emotion_recognition_BLSTM

Bidirectional LSTM network for speech emotion recognition.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

SphereFace

This is a MNIST Implementation for <SphereFace: Deep Hypersphere Embedding for Face Recognition> in CVPR'17.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

tensorflow-triplet-loss

Implementation of triplet loss in TensorFlow

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

License:Apache-2.0Stargazers:0Issues:0Issues:0

VBx

Variational Bayes HMM over x-vectors diarization

Stargazers:0Issues:0Issues:0