Beast code in Giters

KiAlexander's starred repositories

google-research

Google Research

Language:Jupyter NotebookApache-2.033143 749 1191

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonMIT24533 265 619

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.08045 128 1034

lip-reading-deeplearning

:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

Language:PythonApache-2.01813 55 38

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT1790 32 159

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonNOASSERTION1579 37 149

packnet-sfm

TRI-ML Monocular Depth Estimation Repository

Language:PythonMIT1199 56 228

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Language:CudaApache-2.01058 77 369

Res2Net-PretrainedModels

(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"

Language:Python1046 27 74

audino

Open source audio annotation tool for humans

Language:JavaScriptMIT1027 24 56

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonApache-2.0877 44 392

transformer

Implementation of Transformer model (originally from Attention is All You Need) applied to Time Series.

Language:Jupyter NotebookGPL-3.0819 15 58

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Language:Python766 30 40

open-aff

code and trained models for "Attentional Feature Fusion"

Language:Python683 8 42

transformer

A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"

Language:Python534 7 17

Lipreading_using_Temporal_Convolutional_Networks

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Language:PythonNOASSERTION368 9 63

pika

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

Language:PythonApache-2.0338 14 11

pystoi

Python implementation of the Short Term Objective Intelligibility measure

Language:MATLABMIT310 12 19

torchsummaryX

torchsummaryX: Improved visualization tool of torchsummary

Language:Python300 2 21

MMAL-Net

This is a PyTorch implementation of the paper "Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization (MMAL-Net)" (Fan Zhang, Meng Li, Guisheng Zhai, Yizhao Liu).

Language:Python247 5 44