Young Han Lee's repositories

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

captionr-static-web-app

Real-time captioning and translation app on Azure Static Web Apps

Language:VueStargazers:0Issues:1Issues:0

cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

FACEGOOD-Audio2Face

http://www.facegood.cc

License:MITStargazers:0Issues:0Issues:0

gpu-burn

Multi-GPU CUDA stress test

Language:C++License:BSD-2-ClauseStargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

korean-romanizer

A Python library for Korean romanization

License:NOASSERTIONStargazers:0Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

NVAE

The Official PyTorch Implementation of "NVAE: A Deep Hierarchical Variational Autoencoder" (NeurIPS 2020 spotlight paper)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

photometric_optimization

Photometric optimization code for creating the FLAME texture space and other applications

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PyTorch-StudioGAN

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

sherpa

Speech-to-text server framework with next-gen Kaldi

License:Apache-2.0Stargazers:0Issues:0Issues:0

SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stargazers:0Issues:0Issues:0

torchgpipe

A GPipe implementation in PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

Stargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

YOLOX_AUDIO

Audio event detection model based on YOLOX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0