Beast code in Giters

xuanjihe's repositories

speech-emotion-recognition

speech emotion recognition using a convolutional recurrent networks based on IEMOCAP

Language:Python388 13 43

cmu-thesis

Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling

Language:PythonMIT1 10

Speech-Emotion-Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Language:Jupyter NotebookMIT1 10

tf-kaldi-speaker

Neural speaker recognition/verification system based on Kaldi and Tensorflow

Language:PythonApache-2.01 10

wav2letter

Facebook AI Research Automatic Speech Recognition Toolkit

Language:C++NOASSERTION1 10

AIR-ASVspoof

Implementation of the paper "One-class Learning towards Generalized Voice Spoofing Detection"

Language:PythonMIT000

ASSERT

JHU's system submission to the ASVspoof 2019 Challenge: Anti-Spoofing with Squeeze-Excitation and Residual neTworks (ASSERT).

Language:MATLABMIT010

Auto-Tuning-Spectral-Clustering

This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

Language:PythonMIT010

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Apache-2.0010

CircleLoss

Pytorch implementation of the paper "Circle Loss: A Unified Perspective of Pair Similarity Optimization"

000

Dcase2018_pooling

Repo for our pooling approach on the DCASE2018 task4

Apache-2.0000

deep-voice-conversion

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Language:PythonMIT010

ECAPA-TDNN

000

Factorized-TDNN

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Language:PythonMIT010

GradientReversal

Gradient Reversal Layer for Domain Adaptation

Language:Python000

kaldi

This is now the official location of the Kaldi project.

Language:ShellNOASSERTION000

MomentumContrast.pytorch

Reproduction of Momentum Contrast for Unsupervised Visual Representation Learning

Language:PythonMIT010

prefetch_generator

Simple package that makes your generator work in background thread

NOASSERTION000

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonApache-2.0010

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language:PythonMIT010

QAMFace

Pytorch implementation of Quadratic Additive Angular Margin Loss for Face Recognition

Language:PythonMIT010

speaker_embedding_moco

Language:PythonNOASSERTION010

Speaker_Verification

Tensorflow implementation of generalized end-to-end loss for speaker verification

Language:Python010

spec_augment

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

MIT000

SpectralCluster

Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"

Language:PythonApache-2.0010

Speech_emotion_recognition_BLSTM

Bidirectional LSTM network for speech emotion recognition.

Language:PythonMIT010

SphereFace

This is a MNIST Implementation for <SphereFace: Deep Hypersphere Embedding for Face Recognition> in CVPR'17.

Language:PythonMIT010

tensorflow-triplet-loss

Implementation of triplet loss in TensorFlow

Language:PythonMIT010

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Apache-2.0000

VBx

Variational Bayes HMM over x-vectors diarization

000