hayeong0

followers

following

stars

Korea University

Seoul, Republic of Korea

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

Organizations

brave-people

Global-Handong-Oriented-Security-Team

Hayeong's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.029681 172 480

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION25745 211 229

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT14295 110 341

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonMIT8521 588 129

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION7390 88 122

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookApache-2.05706 86 136

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonMIT4291 39 152

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.03956 54 81

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT3948 115 77

awesome-kan

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.

AniTalker

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Language:Jupyter NotebookApache-2.01327 64 32

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT1159 20 48

MimicBrush

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Language:PythonApache-2.01040 14 16

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonNOASSERTION422 10 14

ect

Consistency Models Made Easy

Language:Python187 6 11

FAcodec

Training code for FAcodec presented in NaturalSpeech3

Language:Python142 9 17

VoiceLDM

VoiceLDM: Text-to-Speech with Environmental Context

Language:PythonApache-2.0141 7 4

naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Language:Python138 6 5

AudioEditingCode

Language:PythonCC-BY-SA-4.0129 4 5

TextrolSpeech

TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models (2024 ICASSP)

Language:PythonMIT116 6 1

SEMamba

This is the official implementation of the SEMamba paper.

Language:Python107 12 10

friendly-stable-audio-tools

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Language:PythonMIT106 3 2

LLM-Codec

The open source code for LLM-Codec

Language:Python104 13 4

OpenDMD

Open source implementation and models of One-step Diffusion with Distribution Matching Distillation

Language:PythonGPL-2.098 5 7

language-quantized-autoencoders

Language Quantized AutoEncoders

Language:Python92 1 3

encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Language:Python80 4 2

tinyvc

a lightweight voice conversion

Language:PythonApache-2.076 8 3

soundctm

Pytorch implementation of SoundCTM

Language:PythonMIT69 3 1

FreeV

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Language:PythonMIT67 4 4