Hayeong (hayeong0)

hayeong0

Geek Repo

Company:Korea University

Location:Seoul, Republic of Korea

Home Page:https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

Github PK Tool:Github PK Tool


Organizations
brave-people
Global-Handong-Oriented-Security-Team

Hayeong's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:29681Issues:172Issues:480

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25745Issues:211Issues:229

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:14295Issues:110Issues:341

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:8521Issues:588Issues:129

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7390Issues:88Issues:122

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5706Issues:86Issues:136

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:4291Issues:39Issues:152

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:3956Issues:54Issues:81

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3948Issues:115Issues:77

awesome-kan

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.

AniTalker

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1327Issues:64Issues:32

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1159Issues:20Issues:48

MimicBrush

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Language:PythonLicense:Apache-2.0Stargazers:1040Issues:14Issues:16

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonLicense:NOASSERTIONStargazers:422Issues:10Issues:14

ect

Consistency Models Made Easy

FAcodec

Training code for FAcodec presented in NaturalSpeech3

VoiceLDM

VoiceLDM: Text-to-Speech with Environmental Context

Language:PythonLicense:Apache-2.0Stargazers:141Issues:7Issues:4

naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Language:PythonLicense:CC-BY-SA-4.0Stargazers:129Issues:4Issues:5

TextrolSpeech

TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models (2024 ICASSP)

Language:PythonLicense:MITStargazers:116Issues:6Issues:1

SEMamba

This is the official implementation of the SEMamba paper.

friendly-stable-audio-tools

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Language:PythonLicense:MITStargazers:106Issues:3Issues:2

LLM-Codec

The open source code for LLM-Codec

OpenDMD

Open source implementation and models of One-step Diffusion with Distribution Matching Distillation

Language:PythonLicense:GPL-2.0Stargazers:98Issues:5Issues:7

language-quantized-autoencoders

Language Quantized AutoEncoders

encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

tinyvc

a lightweight voice conversion

Language:PythonLicense:Apache-2.0Stargazers:76Issues:8Issues:3

soundctm

Pytorch implementation of SoundCTM

Language:PythonLicense:MITStargazers:69Issues:3Issues:1

FreeV

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Language:PythonLicense:MITStargazers:67Issues:4Issues:4