happlydata's repositories
SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
SEMamba
This is the official implementation of the SEMamba paper.
midifile
C++ classes for reading/writing Standard MIDI Files
MIDI-BERT
This is the official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.
Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
DiffSpeaker
This is the official repository for DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer
inferno
🔥🔥🔥 Set the world of 3D faces on fire with INFERNO 🔥🔥🔥
voxangeles
VoxAngeles Corpus
BYOC
[IEEE-VR 2024] Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters
NKF_train
NKF training
TCN-beat-tracker-pytorch
PyTorch implementation of "Temporal convolutional networks for musical audio beat tracking"
pretty-midi
Utility functions for handling MIDI data in a nice/intuitive way.
ai-audio-startups
Community list of startups working with AI in audio and music technology
awesome-audio-plaza
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
resemble-enhance
AI powered speech denoising and enhancement
real-time-lyrics-alignment
Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024
pesto
Self-supervised learning for fast pitch estimation
gtcrn
An official implementation of GTCRN, an ultra-lite speech enhancement model.
RUI_SE
The official repo of "A Refining Underlying Information Framework for Speech Enhancement"
deepvqe
An unofficial implementation of DeepVQE proposed by Microsoft Corp.
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
facialanimation
Source code for: Expressive Speech-driven Facial Animation with controllable emotions
BEAT
BEAT huawei 3D dataset
CoMoSVC
CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone
pitch-detection
autocorrelation-based O(NlogN) pitch detection