You Zhang's starred repositories
paper-reading
深度学习经典、新论文逐段精读
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
ai-audio-startups
Community list of startups working with AI in audio and music technology
Awesome-Implicit-NeRF-Robotics
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
sound-spaces
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.
Awesome-Speech-Pretraining
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
SeqDeepFake
[ECCV 2022] PyTorch code for SeqDeepFake: Detecting and Recovering Sequential DeepFake Manipulation
CVPR-2021-Paper-Statistics
Statistics and Visualization of acceptance rate, main keyword of CVPR 2021 accepted papers for the main Computer Vision conference (CVPR)
Skipping-The-Frame-Level
A simple yet effective Audio-to-Midi Automatic Piano Transcription system
heterogeneous_separation
Code and data recipes for the paper: Heterogeneous Target Speech Separation
LookForTheChange
Code for Look for the Change paper published at CVPR 2022