You Zhang (yzyouzhang)

yzyouzhang

Geek Repo

Company:University of Rochester

Location:NY, US

Home Page:https://yzyouzhang.com

Twitter:@yzyouzhang

Github PK Tool:Github PK Tool


Organizations
AirLabUR

You Zhang's starred repositories

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:28805Issues:215Issues:525

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23762Issues:195Issues:3733

hydra

Hydra is a framework for elegantly configuring complex applications

Language:PythonLicense:MITStargazers:8358Issues:125Issues:1364

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:7865Issues:151Issues:528

latexify_py

A library to generate LaTeX expression from Python code.

Language:PythonLicense:Apache-2.0Stargazers:7099Issues:55Issues:82

axel

Lightweight CLI download accelerator

Language:CLicense:GPL-2.0Stargazers:2872Issues:57Issues:273

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

sms-tools

Sound analysis/synthesis tools for music applications

Language:PythonLicense:AGPL-3.0Stargazers:1604Issues:157Issues:102

responsible-ai-toolbox

Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.

Language:TypeScriptLicense:MITStargazers:1271Issues:29Issues:276

Spatial_Audio_Framework

A cross-platform framework for developing spatial audio algorithms and software in C/C++

Language:CLicense:NOASSERTIONStargazers:534Issues:28Issues:21

wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.

Speech-Resources

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

pt-dec

PyTorch implementation of DEC (Deep Embedding Clustering)

Language:PythonLicense:MITStargazers:282Issues:6Issues:12
Language:PythonLicense:MITStargazers:226Issues:8Issues:12

if-sad-send-cat

🐱 A program that sends cats to my phone when I'm sad at the computer.

s2v_rc

Speech2Vec Reality Check

sfs-python

SFS Toolbox for Python

Language:PythonLicense:MITStargazers:64Issues:14Issues:45

simple-asgan

Training code and trained checkpoints for ASGAN.

ASDNet

Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset

couta

a time series anomaly detection method based on the calibrated one-class classifier

Language:PythonLicense:Apache-2.0Stargazers:49Issues:2Issues:5

SoundSynth

Code for sound synthesis

Signal-Generator

The signal generator is a mex-function for MATLAB that can be used to generate the response of a moving sound source and receiver in a reverberant environment.

Language:C++License:GPL-3.0Stargazers:33Issues:2Issues:1

SpectroMap

SpectroMap is a peak detection algorithm that computes the constellation map for a given signal

Language:PythonLicense:GPL-3.0Stargazers:29Issues:2Issues:2

samo

SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING

Language:PythonLicense:MITStargazers:29Issues:1Issues:2

rethinking-visual-sound-localization

Official implementation of the paper How to Listen? Rethinking Visual Sound Localization

Language:PythonLicense:MITStargazers:16Issues:2Issues:1

Comparative-Analysis-Voice-Spoofing

A comapartive analysis of voice spoofing detection systems, based on a paper available at https://arxiv.org/abs/2210.00417.

Language:MATLABStargazers:11Issues:1Issues:0

binaural-auditory-model-RAA

Binaural auditory model using the AMT Toolbox framework, as used in the paper "Predicting perceived reverberation in diferent room acoustic environments using a binaural auditory model"

Language:MATLABStargazers:4Issues:2Issues:0