Sangwook Han's repositories

Y-vector

Y-vector: Multiscale Waveform Encoder for Speaker Embedding

Stargazers:1Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Res2Net-PretrainedModels

(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"

Stargazers:0Issues:0Issues:0

byol-pytorch

Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch

License:MITStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

ECANet

Code for ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

License:MITStargazers:0Issues:0Issues:0

torch-plda

PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf

License:MITStargazers:0Issues:0Issues:0

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

License:MITStargazers:1Issues:0Issues:0

meta-SR

Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)

Stargazers:0Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

License:MITStargazers:0Issues:0Issues:0

meta-tasnet

A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pase

Problem Agnostic Speech Encoder

License:MITStargazers:0Issues:0Issues:0

pytorch-gradual-warmup-lr

Gradually-Warmup Learning Rate Scheduler for PyTorch

License:MITStargazers:0Issues:0Issues:0

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

License:MITStargazers:0Issues:0Issues:0

pytorch-loss

label-smooth, amsoftmax, focal-loss, triplet-loss. Maybe useful

License:MITStargazers:0Issues:0Issues:0

voxceleb_trainer

In defence of metric learning for speaker recognition

License:MITStargazers:0Issues:0Issues:0

Dual-Path-RNN-Pytorch

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Stargazers:0Issues:0Issues:0

Conv-TasNet-3

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Stargazers:0Issues:0Issues:0

keras-tcn

Keras Temporal Convolutional Network.

License:MITStargazers:0Issues:0Issues:0

Conference-Acceptance-Rate

Statistics of acceptance rate for the main AI conference

Stargazers:0Issues:0Issues:0

RawNet

Reproducing RawNet paper with Keras and additional experiments with PyTorch.

Stargazers:0Issues:0Issues:0

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stargazers:0Issues:0Issues:0

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

License:MITStargazers:0Issues:0Issues:0

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Stargazers:0Issues:0Issues:0

Conv-TasNet-1

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

License:MITStargazers:0Issues:0Issues:0

keras-attention-mechanism

Attention mechanism Implementation for Keras.

License:Apache-2.0Stargazers:0Issues:0Issues:0

TDNN-1

Time delay neural network (TDNN) implementation in Pytorch using unfold method

Stargazers:0Issues:0Issues:0

Factorized-TDNN

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

License:MITStargazers:1Issues:0Issues:0

pytorch-kaldi-neural-speaker-embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0