Young Han Lee's repositories
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
bark
🔊 Text-Prompted Generative Audio Model
captionr-static-web-app
Real-time captioning and translation app on Azure Static Web Apps
cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
CLAP
Contrastive Language-Audio Pretraining
espnet
End-to-End Speech Processing Toolkit
FACEGOOD-Audio2Face
http://www.facegood.cc
gpu-burn
Multi-GPU CUDA stress test
korean-romanizer
A Python library for Korean romanization
MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2
photometric_optimization
Photometric optimization code for creating the FLAME texture space and other applications
PyTorch-StudioGAN
StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.
Resemblyzer
A python package to analyze and compare voices with deep learning
sherpa
Speech-to-text server framework with next-gen Kaldi
SimCLR
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
torchgpipe
A GPipe implementation in PyTorch
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
YOLOX_AUDIO
Audio event detection model based on YOLOX