Vinay Kothapally's repositories
Complex-valued-Attention
Transformer based Self-Attention for Complex Numbers
Complex-valued-DNN-Speech-Enhancement
Complex valued Deep Neural Network for Speech Enhancement
Complex-valued-GRU-PyTorch
Gated Recurrent Neural Networks for Complex Numbers
Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
awesome-speech-enhancement-1
speech enhancement\speech seperation\sound source localization
Complex-valued-Deformable-Convolutions
Deformable Convolutions for Complex Numbers
Machine-Learning-Collection
A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)
Adaptive-deformable-convolution
Pytorch-based adaptive deformable convolution
EfficientDNNs
Collection of recent methods on (deep) neural network compression and acceleration.
Neural-Speech-Dereverberation
Machine and Deep Learning models for speech dereverberation
scientific-visualization-book
An open access book on scientific visualization using python and matplotlib
TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥
transformer
Implementation of "Attention Is All You Need" using pytorch
UniTrack
[NeurIPS'21] Unified tracking framework with a single appearance model. It supports Single Object Tracking (SOT), Video Object Segmentation (VOS), Multi-Object Tracking (MOT), Multi-Object Tracking and Segmentation (MOTS), Pose Tracking, Video Instance Segmentation (VIS), and class-agnostic MOT (e.g. TAO dataset).
ASH-IR-Dataset
An impulse response dataset for binaural synthesis of spatial audio systems on headphones
audio-development-tools
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.
beamformers
Easy to use Beamformers for multi-channel speech separation/enhancement
MLfAS
Machine Learning for Audio Signals in Python
SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
torchsubband
Pytorch implementation of subband decomposition