Young Han Lee's starred repositories
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
korean-romanizer
A Python library for Korean romanization
PyConKR2023-ModelServing-BentoML
Pycon KR 2023 presentation
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
SVCC23_FastSVC
Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation
photometric_optimization
Photometric optimization code for creating the FLAME texture space and other applications
DualCycleGAN
Official implementation of DualCycleGAN for nonparallel audio super resolution
MB-iSTFT-VITS
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
Awesome-Gaze-Estimation
Awesome Curated List of Eye Gaze Estimation Paper
FACEGOOD-Audio2Face
http://www.facegood.cc
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
YOLOX_AUDIO
Audio event detection model based on YOLOX
torchgpipe
A GPipe implementation in PyTorch
code-server
VS Code in the browser