You Zhang (yzyouzhang)

yzyouzhang

Geek Repo

Company:University of Rochester

Location:NY, US

Home Page:https://yzyouzhang.com

Twitter:@yzyouzhang

Github PK Tool:Github PK Tool


Organizations
AirLabUR

You Zhang's starred repositories

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32272Issues:273Issues:1068

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:25157Issues:706Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19253Issues:297Issues:1340

gdrive

Google Drive CLI Client

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:7974Issues:150Issues:532

latexify_py

A library to generate LaTeX expression from Python code.

Language:PythonLicense:Apache-2.0Stargazers:7112Issues:55Issues:82

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonLicense:Apache-2.0Stargazers:5058Issues:31Issues:52

improved-diffusion

Release for Improved Denoising Diffusion Probabilistic Models

Language:PythonLicense:MITStargazers:3053Issues:124Issues:127

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:1885Issues:40Issues:43

Awesome-Implicit-NeRF-Robotics

A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites

AD-NeRF

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Language:PythonLicense:MITStargazers:1009Issues:16Issues:138

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonLicense:MITStargazers:872Issues:23Issues:32

AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Language:PythonLicense:NOASSERTIONStargazers:510Issues:34Issues:27

Speech-Resources

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

DFRF

[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".

Language:PythonLicense:MITStargazers:335Issues:10Issues:37

pt-dec

PyTorch implementation of DEC (Deep Embedding Clustering)

Language:PythonLicense:MITStargazers:288Issues:6Issues:12

if-sad-send-cat

🐱 A program that sends cats to my phone when I'm sad at the computer.

Language:PythonLicense:Apache-2.0Stargazers:167Issues:8Issues:7

BIRD

Big Impulse Response Dataset

Language:PythonLicense:GPL-3.0Stargazers:136Issues:9Issues:2

SeqDeepFake

[ECCV 2022] PyTorch code for SeqDeepFake: Detecting and Recovering Sequential DeepFake Manipulation

Skipping-The-Frame-Level

A simple yet effective Audio-to-Midi Automatic Piano Transcription system

Language:PythonLicense:MITStargazers:77Issues:7Issues:17

s2v_rc

Speech2Vec Reality Check

sfs-python

SFS Toolbox for Python

Language:PythonLicense:MITStargazers:64Issues:14Issues:45

simple-asgan

Training code and trained checkpoints for ASGAN.

itsp

Introduction to Speech Processing

Language:Jupyter NotebookLicense:CC-BY-SA-4.0Stargazers:52Issues:4Issues:4

couta

a time series anomaly detection method based on the calibrated one-class classifier

Language:PythonLicense:Apache-2.0Stargazers:50Issues:2Issues:5

libmpeghe

MPEG-H 3D Audio Low Complexity Profile Encoder. Decoder: https://github.com/ittiam-systems/libmpegh

Language:CLicense:BSD-3-Clause-ClearStargazers:41Issues:4Issues:8

heterogeneous_separation

Code and data recipes for the paper: Heterogeneous Target Speech Separation

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

LookForTheChange

Code for Look for the Change paper published at CVPR 2022

Language:PythonLicense:MITStargazers:35Issues:4Issues:6

Signal-Generator

The signal generator is a mex-function for MATLAB that can be used to generate the response of a moving sound source and receiver in a reverberant environment.

Language:C++License:GPL-3.0Stargazers:33Issues:2Issues:1