You Zhang's starred repositories
generative-models
Generative Models by Stability AI
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
neuralangelo
Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)
awesome-python-scientific-audio
Curated list of python software and packages related to scientific research in audio
Audio-driven-TalkingFace-HeadPose
Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)
Awesome-Diffusion-Personalization
A collection of resources on personalization with diffusion models.
emotion-classification-from-audio-files
Understanding emotions from audio files using neural networks and multiple datasets.
Point-Bind_Point-LLM
Align 3D Point Cloud with Multi-modalities for Large Language Models
torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
Synthetic-Voice-Detection-Vocoder-Artifacts
This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifacts accepted in CVPR Workshop on Media Forensic 2023.
multimodal-decoding
Code associated with the paper titled "A high-performance neuroprosthesis for speech decoding and avatar control" , published in Nature in 2023.
ScalableFHVAE
This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders"
Audio_Research_in_US
For students who would like to apply for RA, PhD, postdoc in audio research.
Breaking-Security-Critical-Voice-Authentication
Source code for paper "Breaking Security-Critical Voice Authentication".
ntools_elec
Intracranial Electrode Localization
PhaseAntispoofing_INTERSPEECH
Official repository of the paper "Phase perturbation improves channel robustness for speech spoofing countermeasures"