HAESUNG JEON (chad.plus)'s repositories
pflow-encodec
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
transformers
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
optimum
๐ Accelerate training and inference of ๐ค Transformers and ๐ค Diffusers with easy to use hardware optimization tools
so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
acoustic-model
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
espnet
End-to-End Speech Processing Toolkit
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
PS_Recommender
Personalized problem recommender for Baekjoon Online Judge, Capstone Design project of ์ธ๋ผ์ด์ธ๋ผ์ด, Sogang Univ. 2021
GitPycharm
GitPycharm
ForLovePlus
Capture window, crop, OCR, Translate using googletrans, tesseract for Windows
WGANSing
Multi-voice singing voice synthesis
AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
Algorithm_Lecture_Note
ICPC Sinchon 2021 Winter Algorithm Camp ๊ณ ๊ธ๋ฐ์์ ์งํํ ๊ฐ์์ ๊ฐ์ ๋ ธํธ๋ค์ ๋๋ค.