adiyoss

followers

following

stars

HUJI & FAIR

Tel Aviv

https://www.cs.huji.ac.il/~adiyoss/

Yossi Adi's repositories

WatermarkNN

Watermarking Deep Neural Networks (USENIX 2018)

Language:PythonMIT88 3 2

GCommandsPytorch

ConvNets for Audio Recognition using Google Commands Dataset

Language:Python70 4 2

DeepAnomaly

Recurrent Neural Networks for Anomaly Detection using Time Series Data

Language:PythonMIT21 60

DeepSegmentor

Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)

Language:PythonMIT17 50

AutoVowelDuration

Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)

Language:PythonMIT14 70

StructED

Risk Minimization Algorithms in Structured Prediction (JMLR 2016)

Language:JavaNOASSERTION13 5 1

Chroma

Pitch and chroma implementation in java

Language:JavaNOASSERTION7 20

DeepVOT

Automatic Measurement of Voice Onset Time (VOT) using Deep Recurrent Neural Networks (Interspeech 2016)

Language:LuaMIT6 3 3

InDepth-Analysis

Sentence Representation Analysis

Language:PythonMIT2 20

colman_ml

ML course @ colman

Language:Jupyter Notebook1 20

DeepWDM

Recurrent Neural Networks for Word Duration Measurement

Language:PythonMIT1 2 1

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonNOASSERTION1 10

diffq

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weight, in order to achieve a given trade-off between the model size and accuracy.

Language:PythonNOASSERTION100

Expresso

Expresso dataset demo page

Language:HTML1 10

Tools-to-Design-or-Visualize-Architecture-of-Neural-Network

Tools to Design or Visualize Architecture of Neural Network

1 10

adiyoss.github.io

Personal website

Language:HTMLMIT010

audio-cont

Language:HTML010

dataset

NOASSERTION020

dotfiles

dotfiles for vim, tmux, etc.

Language:Shell010

dsVAE-NES

010

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT000

griffin_lim

Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.

Language:PythonBSD-3-Clause020

iTerm2-Color-Schemes

Over 150 terminal color schemes/themes for iTerm/iTerm2 (with ports to Terminal, Konsole, PuTTY, Xresources, XRDB, and Terminator)

020

nltk_contrib

NLTK Contrib

Language:PythonNOASSERTION010

OpenNMT

Open-Source Neural Machine Translation in Torch

Language:LuaMIT030

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CNOASSERTION010

pytorch-stft

An STFT/iSTFT for PyTorch.

Language:PythonBSD-3-Clause020

StarGAN

PyTorch Implementation of StarGAN - CVPR 2018

Language:PythonMIT030

turk

020

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Language:C++NOASSERTION020