Beast code in Giters

Josef's starred repositories

bert

TensorFlow code and pre-trained models for BERT

Language:PythonApache-2.037687 998 1142

google-research

Google Research

Language:Jupyter NotebookApache-2.033653 748 1227

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonMIT22483 1266 100

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellNOASSERTION14029 696 1641

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.011277 201 2196

sonnet

TensorFlow-based neural network library

Language:PythonApache-2.09605 423 187

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.08218 180 2341

Data-Analysis

Data Science Using Python

Language:Jupyter NotebookMIT5154 354 61

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Language:PythonApache-2.01547 102 87

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookMIT1531 45 254

UnsupervisedMT

Phrase-Based & Neural Unsupervised Machine Translation

Language:PythonNOASSERTION1507 122 101

deepops

Tools for building GPU clusters

Language:ShellBSD-3-Clause1239 52 429

loop

A method to generate speech across multiple speakers

Language:PythonNOASSERTION870 68 75

CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Language:PythonMIT813 25 23

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonNOASSERTION440 14 35

audio-annotator

A JavaScript interface for annotating and labeling audio files.

Language:JavaScriptBSD-2-Clause432 17 10

autoEdit_2

Fast text based video editing, node Electron Os X desktop app, with Backbone front end.

Language:JavaScriptMIT418 39 73

open-speech-recording

Web application to record speech for an open data set

Language:HTMLApache-2.0417 25 7

DistanceGAN

Pytorch implementation of "One-Sided Unsupervised Domain Mapping" NIPS 2017

Language:PythonNOASSERTION195 12 4

vcc20_baseline_cyclevae

Voice Conversion Challenge 2020 CycleVAE baseline system

Language:PythonMIT132 6 9

word-embeddings-for-nmt

Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018

Language:Python119 9 5

FrameTrail is an open source software that let's you experience, manage and edit interactive video directly in your web browser. It enables you to hyperlink filmic contents, include additional multimedia documents (e.g. text overlays, images or interactive maps) and to add supplementing materials (annotations) at specific points.

Language:JavaScriptNOASSERTION113 18 39

d2sys