Young Han Lee's repositories
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2
sherpa
Speech-to-text server framework with next-gen Kaldi
bark
🔊 Text-Prompted Generative Audio Model
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
CLAP
Contrastive Language-Audio Pretraining
korean-romanizer
A Python library for Korean romanization
FACEGOOD-Audio2Face
http://www.facegood.cc
gpu-burn
Multi-GPU CUDA stress test
photometric_optimization
Photometric optimization code for creating the FLAME texture space and other applications
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
YOLOX_AUDIO
Audio event detection model based on YOLOX
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
SimCLR
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
espnet
End-to-End Speech Processing Toolkit
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
PyTorch-StudioGAN
StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.
captionr-static-web-app
Real-time captioning and translation app on Azure Static Web Apps
wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Resemblyzer
A python package to analyze and compare voices with deep learning
NVAE
The Official PyTorch Implementation of "NVAE: A Deep Hierarchical Variational Autoencoder" (NeurIPS 2020 spotlight paper)