nikvaessen

Nik's repositories

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.0000

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-2-Clause000

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CMIT000

w2v2-batch-size

Code for paper "The effect of batch size on contrastive self-supervised speech representation learning"

Language:PythonMIT800

nikvaessen

100

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonMIT000

tinygrad-wav2vec2

A wav2vec 2.0 implementation using TinyGrad

Language:Python000

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookMIT000

dscore

Diarization scoring tools.

Language:PythonBSD-2-Clause000

LibriMix

An open source dataset for source separation

Language:PythonMIT000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT100

w2v2-speaker-few-samples

Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688

Language:PythonMIT1100

pytorch-lightning

The lightweight PyTorch wrapper for ML researchers. Scale your models. Write less boilerplate

Language:PythonApache-2.0100

jiwer

Python library for calculating Word Error Rate (WER), a common measure of speech recognition performance

Language:PythonApache-2.0100

debug-hydra-lightning

Language:Python100

disjoint-mtl

Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf

Language:PythonMIT700

MLonHPC_May2023

Contains the material for the Machine Learning on HPC systems course on 16-05-2023

Language:Jupyter Notebook100

awk-course

Material for the "Introduction to awk programming" course at Heidelberg University

Language:AwkNOASSERTION100

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

MIT100

triple_accel

Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.

MIT000

2022-repo-mt-w2v2

Language:Python400

PYLLR

Python toolkit for likelihood-ratio calibration of binary classifiers

MIT100

kagi-docs

Documentation for products made by Kagi Inc

100

VocalAdversary2022

100

data

A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

BSD-3-Clause100

d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 300 universities from 55 countries including Stanford, MIT, Harvard, and Cambridge.

NOASSERTION100

w2v2-speaker

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053

Language:PythonMIT14300

transformers

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Apache-2.0100

dutch-digit-parser

Language:Python100

SBCSAE-preprocess

Preprocessing and downloading scripts for the Santa Barbara Corpus of Spoken American English (SBCSAE).

MIT000