You Zhang (yzyouzhang)

yzyouzhang

Geek Repo

Company:University of Rochester

Location:NY, US

Home Page:https://yzyouzhang.com

Twitter:@yzyouzhang

Github PK Tool:Github PK Tool


Organizations
AirLabUR

You Zhang's repositories

AIR-ASVspoof

Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"

Language:PythonLicense:MITStargazers:92Issues:3Issues:31

ASVspoof2021_AIR

Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"

Language:PythonLicense:MITStargazers:47Issues:4Issues:13

Audio_Research_in_US

For students who would like to apply for RA, PhD, postdoc in audio research.

Stargazers:20Issues:0Issues:0

hrtf_field

Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"

Language:PythonLicense:MITStargazers:20Issues:2Issues:0

SASV_PR

Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"

Language:PythonLicense:MITStargazers:13Issues:2Issues:0

Empirical-Channel-CM

Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems"

Language:PythonLicense:MITStargazers:11Issues:3Issues:3

CS61Bsp18-proj2-byog

Project BYoG for UCB course CS61B Data Structures Spring 2018

Language:JavaStargazers:7Issues:0Issues:0

HBAS_chapter_voice3

Official implementation of the handbook chapter "Generalizing Voice Presentation Attack Detection to Unseen Synthetic Attacks and Channel Variation"

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

HRTF_field_norm

Official Implementation of our WASPAA 2023 paper "Mitigating Cross-Database Differences for Learning Unified HRTF Representation"

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Stargazers:0Issues:0Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:0Issues:0Issues:0

INFO159-LHW4-Chatbot

A pytorch Chatbot for INFO159 Natural Language Processing

Language:PythonStargazers:0Issues:2Issues:0

PhaseAntispoofing_INTERSPEECH

Official repository of the Interspeech 2023 paper "Phase perturbation improves channel robustness for speech spoofing countermeasures"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

License:MITStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

Language:PythonStargazers:0Issues:0Issues:0

flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Online-Recurrent-Extreme-Learning-Machine

Online-Recurrent-Extreme-Learning-Machine (OR-ELM) for time-series prediction, implemented in python

Language:PythonStargazers:0Issues:1Issues:0

samo

SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

serve

Serve, optimize and scale PyTorch models in production

License:Apache-2.0Stargazers:0Issues:0Issues:0

SingFake

Official Repository for "SingFake: Singing Voice Deepfake Detection"

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

SpeechTasks

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

Stargazers:0Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0