vkothapally

Vinay Kothapally's repositories

Complex-valued-Attention

Transformer based Self-Attention for Complex Numbers

Language:PythonApache-2.010 10

Complex-valued-DNN-Speech-Enhancement

Complex valued Deep Neural Network for Speech Enhancement

Apache-2.03 10

Complex-valued-GRU-PyTorch

Gated Recurrent Neural Networks for Complex Numbers

Apache-2.03 1 1

Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

Language:MATLABMIT200

awesome-speech-enhancement-1

speech enhancement\speech seperation\sound source localization

GPL-2.0200

Complex-valued-Deformable-Convolutions

Deformable Convolutions for Complex Numbers

Language:PythonApache-2.02 10

Machine-Learning-Collection

A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)

Language:PythonMIT200

pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

Language:PythonGPL-3.0200

sru

Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)

Language:PythonMIT200

TCN

Sequence modeling benchmarks and temporal convolutional networks

Language:PythonMIT200

Adaptive-deformable-convolution

Pytorch-based adaptive deformable convolution

Language:PythonMIT100

auraloss

Collection of audio-focused loss functions in PyTorch

Language:PythonApache-2.0100

EfficientDNNs

Collection of recent methods on (deep) neural network compression and acceleration.

MIT100

Neural-Speech-Dereverberation

Machine and Deep Learning models for speech dereverberation

Language:PythonGPL-3.0100

scientific-visualization-book

An open access book on scientific visualization using python and matplotlib

Language:PythonNOASSERTION100

StyleSwin

StyleSwin: Transformer-based GAN for High-resolution Image Generation

100

TensorLayer

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Language:PythonNOASSERTION100

transformer

Implementation of "Attention Is All You Need" using pytorch

Language:Python100

[NeurIPS'21] Unified tracking framework with a single appearance model. It supports Single Object Tracking (SOT), Video Object Segmentation (VOS), Multi-Object Tracking (MOT), Multi-Object Tracking and Segmentation (MOTS), Pose Tracking, Video Instance Segmentation (VIS), and class-agnostic MOT (e.g. TAO dataset).

Language:PythonMIT100

ASH-IR-Dataset

An impulse response dataset for binaural synthesis of spatial audio systems on headphones

NOASSERTION000

audio-development-tools

This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.

MIT000

vkothapally

Vinay Kothapally's repositories

JAECBF

Subband-Beamformer

Complex-valued-Attention

Complex-valued-DNN-Speech-Enhancement

Complex-valued-GRU-PyTorch

Awesome-Speech-Enhancement

awesome-speech-enhancement-1

Complex-valued-Deformable-Convolutions

Machine-Learning-Collection

pysepm

sru

TCN

Adaptive-deformable-convolution

auraloss

EfficientDNNs

GCRN-complex

Neural-Speech-Dereverberation

scientific-visualization-book

StyleSwin

TensorLayer

transformer

UniTrack

ASH-IR-Dataset

audio-development-tools

beamformers

MLfAS

pytorch-speech-features

SpeechT5

torch-audiomentations

torchsubband