Tatsu Matsushima's repositories

self-supervised-dutch-dysarthria-asr

This is a repository presenting the outcome of my thesis for MSc. Voice Technology at the University of Groningen. The research developed the Dutch dysarthric speech recognition with self-supervised learning (SSL) models, wav2vec 2.0 and XLSR-53. The repo contains the fine-tuned models and the evaluation dataset.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Megatron-DeepSpeed-Mistral

Ongoing research training transformer language models at scale, including: BERT & GPT-2

License:NOASSERTIONStargazers:0Issues:0Issues:0

nobelium

A static blog build on top of Notion and NextJS, deployed on Vercel.

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

License:MITStargazers:0Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

License:NOASSERTIONStargazers:0Issues:0Issues:0

mnist-fashion

This repo contains implementations of logistic regression, multi-layer perceptron, and convolutional neural networks for MNIST fashion dataset.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audio_feature_extraction

This repository contains implementations of audio feature extractions inlcluding spectrogram, mel-scale spectrogram, mel-frequency cepstrum coefficients (MFCC) with numpy.

Language:PythonStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

License:MITStargazers:0Issues:0Issues:0