kugatsu-sudo's starred repositories
free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST.
light-reid
[ECCV2020] a toolbox of light-reid learning for faster inference, speed both feature extraction and retrieval stages up to >30x
leaf-audio
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a very small number of parameters.
Relation-Aware-Global-Attention-Networks
We design an effective Relation-Aware Global Attention (RGA) module for CNNs to globally infer the attention.
end2end-asr-pytorch
End-to-End Automatic Speech Recognition on PyTorch
leaf-audio-pytorch
Pytorch port of Google Research's LEAF Audio paper
hypergraph_reid
Code for CVPR 2020 paper Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification
Image_Filter
Commonly used image filters. :earth_americas: 包罗常见的图像滤波器。
IIR-filter
An IIR filter class implementation in Python
snn-for-asr
Pytorch-Kaldi implementation of SNN-based ASR systems
snn_speechrec
Convolutional Spiking Neural Network to recognize speech utterances using Spike-Timing-Dependent Plasticity
BandpassGeffesAlgorithm
Python program for bandpass filter design
SpeechSpeakerRecognition
Repository for course in Speech and Speaker Recognition lab tasks
DigitsSpeech
Speech recognition model for digits from 0 to 9.