Josef's starred repositories

100-Days-Of-ML-Code

100 Days of ML Coding

License:MITStargazers:44340Issues:2443Issues:0

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37687Issues:998Issues:1142

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33653Issues:748Issues:1227

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonLicense:MITStargazers:22483Issues:1266Issues:100

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:14029Issues:696Issues:1641

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11277Issues:201Issues:2196

sonnet

TensorFlow-based neural network library

Language:PythonLicense:Apache-2.0Stargazers:9605Issues:423Issues:187

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8218Issues:180Issues:2341

Data-Analysis

Data Science Using Python

Language:Jupyter NotebookLicense:MITStargazers:5154Issues:354Issues:61

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Language:PythonLicense:Apache-2.0Stargazers:1547Issues:102Issues:87

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:1531Issues:45Issues:254

UnsupervisedMT

Phrase-Based & Neural Unsupervised Machine Translation

Language:PythonLicense:NOASSERTIONStargazers:1507Issues:122Issues:101

deepops

Tools for building GPU clusters

Language:ShellLicense:BSD-3-ClauseStargazers:1239Issues:52Issues:429

loop

A method to generate speech across multiple speakers

Language:PythonLicense:NOASSERTIONStargazers:870Issues:68Issues:75

CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Language:PythonLicense:MITStargazers:813Issues:25Issues:23

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonLicense:NOASSERTIONStargazers:440Issues:14Issues:35

audio-annotator

A JavaScript interface for annotating and labeling audio files.

Language:JavaScriptLicense:BSD-2-ClauseStargazers:432Issues:17Issues:10

autoEdit_2

Fast text based video editing, node Electron Os X desktop app, with Backbone front end.

Language:JavaScriptLicense:MITStargazers:418Issues:39Issues:73

open-speech-recording

Web application to record speech for an open data set

Language:HTMLLicense:Apache-2.0Stargazers:417Issues:25Issues:7

DistanceGAN

Pytorch implementation of "One-Sided Unsupervised Domain Mapping" NIPS 2017

Language:PythonLicense:NOASSERTIONStargazers:195Issues:12Issues:4

vcc20_baseline_cyclevae

Voice Conversion Challenge 2020 CycleVAE baseline system

Language:PythonLicense:MITStargazers:132Issues:6Issues:9

word-embeddings-for-nmt

Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018

FrameTrail

FrameTrail is an open source software that let's you experience, manage and edit interactive video directly in your web browser. It enables you to hyperlink filmic contents, include additional multimedia documents (e.g. text overlays, images or interactive maps) and to add supplementing materials (annotations) at specific points.

Language:JavaScriptLicense:NOASSERTIONStargazers:113Issues:18Issues:39

nla2020

Github repository for NLA2020 course

Language:Jupyter NotebookLicense:MITStargazers:82Issues:12Issues:1

ml-deployment-demo

ML Deployment, Two Ways

Language:Jupyter NotebookStargazers:57Issues:3Issues:1

audio_degrader

Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.

Language:PythonLicense:GPL-3.0Stargazers:56Issues:3Issues:26

ffmpeg-commands

Collection of useful FFMPEG commands for processing audio and video files.

Stargazers:43Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:32Issues:7Issues:0

nla2016

Repository for 2016 NLA course @ Skoltech

Language:Jupyter NotebookLicense:MITStargazers:20Issues:8Issues:4