Beast code in Giters

zhongshijun's repositories

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT000

AudioSep

Official implementation of "Separate Anything You Describe"

000

auto-assess-rhythm-imitation

Code for automatic assessment of rhythmic pattern imitations

GPL-3.0000

CodeTalker

[CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior

MIT000

CoMoSVC

CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone

000

ComputeLibrary

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

MIT000

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Apache-2.0000

crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

MIT000

CRUSE

TOWARDS EFFICIENT MODELS FOR REAL-TIME DEEP NOISE SUPPRESSION

000

DALL-E

PyTorch package for the discrete VAE used for DALL·E.

NOASSERTION000

deepvqe

An unofficial implementation of DeepVQE proposed by Microsoft Corp.

000

DiffPitcher

Diffusion-based singing voice pitch correction

000

Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

000

e2e_dnn_ad_control_for_lin_aec

End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation

NOASSERTION000

easyeffects

Limiter, compressor, convolver, equalizer and auto volume and many other plugins for PipeWire applications

GPL-3.0000

EAT_code

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

000

gtcrn

An official implementation of GTCRN, an ultra-lite speech enhancement model.

000

hello-world

Is my first repository.

000

HRTF_field_norm

BSD-3-Clause000

ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

NOASSERTION000

motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

MIT000

Motion-X

Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"

NOASSERTION000

NeuCoSVC

000

NeuralSVB

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

GPL-3.0000

RUI_SE

The official repo of "A Refining Underlying Information Framework for Speech Enhancement"

000

sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

MIT000

so-vits-svc

SoftVC VITS Singing Voice Conversion

AGPL-3.0000

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

MIT000

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

000

webrtcperf

WebRTC performance and quality evaluation tool.

AGPL-3.0000