seastar105

HAESUNG JEON (chad.plus)'s repositories

pflow-encodec

Implementation of TTS model based on NVIDIA P-Flow TTS Paper

Language:Python5500

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookNOASSERTION000

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT000

kr-custom-tts-server

Language:Python300

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Apache-2.0000

seastar105.github.io

Language:HTML000

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Apache-2.0000

shared_debugging_code

000

kr-custom-tts

Language:Jupyter Notebook1200

so-vits-svc-5.0

Core Engine of Singing Voice Conversion & Singing Voice Clone

MIT000

DCTCRN

Language:Python200

sample_for_project

000

acoustic-model

Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Language:PythonMIT000

Moetion

A MediaPipe Solver library, inspired of kalidokit

BSD-3-Clause100

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonApache-2.0000