happlydata's repositories

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

License:NOASSERTIONStargazers:0Issues:0Issues:0

SEMamba

This is the official implementation of the SEMamba paper.

Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

midifile

C++ classes for reading/writing Standard MIDI Files

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

MIDI-BERT

This is the official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

License:MITStargazers:0Issues:0Issues:0

Awesome-Talking-Face

📖 A curated list of resources dedicated to talking face.

License:MITStargazers:0Issues:0Issues:0

DiffSpeaker

This is the official repository for DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

Stargazers:0Issues:0Issues:0

inferno

🔥🔥🔥 Set the world of 3D faces on fire with INFERNO 🔥🔥🔥

License:NOASSERTIONStargazers:0Issues:0Issues:0

voxangeles

VoxAngeles Corpus

Stargazers:0Issues:0Issues:0

BYOC

[IEEE-VR 2024] Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters

Stargazers:0Issues:0Issues:0

NKF_train

NKF training

Stargazers:0Issues:0Issues:0

TCN-beat-tracker-pytorch

PyTorch implementation of "Temporal convolutional networks for musical audio beat tracking"

Stargazers:0Issues:0Issues:0

pretty-midi

Utility functions for handling MIDI data in a nice/intuitive way.

License:MITStargazers:0Issues:0Issues:0

ai-audio-startups

Community list of startups working with AI in audio and music technology

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

License:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

resemble-enhance

AI powered speech denoising and enhancement

License:MITStargazers:0Issues:0Issues:0

real-time-lyrics-alignment

Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

pesto

Self-supervised learning for fast pitch estimation

License:LGPL-3.0Stargazers:0Issues:0Issues:0

gtcrn

An official implementation of GTCRN, an ultra-lite speech enhancement model.

Stargazers:0Issues:0Issues:0

RUI_SE

The official repo of "A Refining Underlying Information Framework for Speech Enhancement"

Stargazers:0Issues:0Issues:0

deepvqe

An unofficial implementation of DeepVQE proposed by Microsoft Corp.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

License:MITStargazers:0Issues:0Issues:0

facialanimation

Source code for: Expressive Speech-driven Facial Animation with controllable emotions

License:Apache-2.0Stargazers:0Issues:0Issues:0

BEAT

BEAT huawei 3D dataset

Stargazers:0Issues:0Issues:0

CoMoSVC

CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pitch-detection

autocorrelation-based O(NlogN) pitch detection

License:MITStargazers:0Issues:0Issues:0