Vinay Kothapally (vkothapally)

vkothapally

Geek Repo

Company:Center for Robust Speech Systems (CRSS)

Location:Dallas, TX

Github PK Tool:Github PK Tool

Vinay Kothapally's repositories

Language:HTMLLicense:CC0-1.0Stargazers:31Issues:2Issues:0

Complex-valued-Attention

Transformer based Self-Attention for Complex Numbers

Language:PythonLicense:Apache-2.0Stargazers:10Issues:1Issues:0

Complex-valued-DNN-Speech-Enhancement

Complex valued Deep Neural Network for Speech Enhancement

License:Apache-2.0Stargazers:3Issues:1Issues:0

Complex-valued-GRU-PyTorch

Gated Recurrent Neural Networks for Complex Numbers

Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

Language:MATLABLicense:MITStargazers:2Issues:0Issues:0

awesome-speech-enhancement-1

speech enhancement\speech seperation\sound source localization

License:GPL-2.0Stargazers:2Issues:0Issues:0

Complex-valued-Deformable-Convolutions

Deformable Convolutions for Complex Numbers

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

Machine-Learning-Collection

A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

Language:PythonLicense:GPL-3.0Stargazers:2Issues:0Issues:0

sru

Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

TCN

Sequence modeling benchmarks and temporal convolutional networks

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

Adaptive-deformable-convolution

Pytorch-based adaptive deformable convolution

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

auraloss

Collection of audio-focused loss functions in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

EfficientDNNs

Collection of recent methods on (deep) neural network compression and acceleration.

License:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

Neural-Speech-Dereverberation

Machine and Deep Learning models for speech dereverberation

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

scientific-visualization-book

An open access book on scientific visualization using python and matplotlib

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

StyleSwin

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Stargazers:1Issues:0Issues:0

TensorLayer

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

transformer

Implementation of "Attention Is All You Need" using pytorch

Language:PythonStargazers:1Issues:0Issues:0

UniTrack

[NeurIPS'21] Unified tracking framework with a single appearance model. It supports Single Object Tracking (SOT), Video Object Segmentation (VOS), Multi-Object Tracking (MOT), Multi-Object Tracking and Segmentation (MOTS), Pose Tracking, Video Instance Segmentation (VIS), and class-agnostic MOT (e.g. TAO dataset).

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

ASH-IR-Dataset

An impulse response dataset for binaural synthesis of spatial audio systems on headphones

License:NOASSERTIONStargazers:0Issues:0Issues:0

audio-development-tools

This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.

License:MITStargazers:0Issues:0Issues:0

beamformers

Easy to use Beamformers for multi-channel speech separation/enhancement

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MLfAS

Machine Learning for Audio Signals in Python

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

torchsubband

Pytorch implementation of subband decomposition

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0