Beast code in Giters

Sangwook Han's repositories

Y-vector

Y-vector: Multiscale Waveform Encoder for Speaker Embedding

100

Res2Net-PretrainedModels

(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"

000

byol-pytorch

Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch

MIT000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

MIT000

ECANet

Code for ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

MIT000

torch-plda

PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf

MIT000

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

MIT100

meta-SR

Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)

000

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

MIT000

meta-tasnet

A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation

MIT000

pytorch-gradual-warmup-lr

Gradually-Warmup Learning Rate Scheduler for PyTorch

MIT000

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

MIT000

pytorch-loss

label-smooth, amsoftmax, focal-loss, triplet-loss. Maybe useful

MIT000

voxceleb_trainer

In defence of metric learning for speaker recognition

MIT000

Dual-Path-RNN-Pytorch

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

000

Conv-TasNet-3

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

000

keras-tcn

Keras Temporal Convolutional Network.

MIT000

Conference-Acceptance-Rate

Statistics of acceptance rate for the main AI conference

000

RawNet

Reproducing RawNet paper with Keras and additional experiments with PyTorch.

000

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

000

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

MIT000

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

000

Conv-TasNet-1

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

MIT000

keras-attention-mechanism

Attention mechanism Implementation for Keras.

Apache-2.0000

TDNN-1

Time delay neural network (TDNN) implementation in Pytorch using unfold method

000

Factorized-TDNN

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

MIT100

pytorch-kaldi-neural-speaker-embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

BSD-3-Clause000

swhan9873

Sangwook Han's repositories

Y-vector

ECAPA-TDNN

Res2Net-PretrainedModels

byol-pytorch

fairseq

ECANet

torch-plda

pytorch_xvectors

meta-SR

pyannote-audio

meta-tasnet

data-driven-harmonic-filters

pase

pytorch-gradual-warmup-lr

the-incredible-pytorch

pytorch-loss

voxceleb_trainer

Dual-Path-RNN-Pytorch

Conv-TasNet-3

keras-tcn

Conference-Acceptance-Rate

RawNet

pytorch-kaldi

attention-is-all-you-need-pytorch

Speech-Transformer

Conv-TasNet-1

keras-attention-mechanism

TDNN-1

Factorized-TDNN

pytorch-kaldi-neural-speaker-embeddings