Ge Zhu (朱舸) (gzhu06)

gzhu06

User data from Github https://github.com/gzhu06

Company:University of Rochester

Location:San Francisco

Home Page:gzhu06.github.io

GitHub:@gzhu06

Ge Zhu (朱舸)'s repositories

Cacophony

Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986

Language:PythonLicense:MITStargazers:48Issues:4Issues:5

GenerativeSourceSeparation

Open source code for the paper 'Music Source Separation with Generative Flow'

Language:Jupyter NotebookLicense:MITStargazers:26Issues:1Issues:1

Y-vector

Y-vector: Multiscale Waveform Encoder for Speaker Embedding

Language:PythonLicense:MITStargazers:23Issues:2Issues:0

Manifold-Constrained-Gradient-ipynb

Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/abs/2206.00941]

Language:Jupyter NotebookLicense:MITStargazers:12Issues:1Issues:2

Unconditional-Audio-Generation-Benchmark

Unconditional audio generation benchmark

Language:PythonLicense:Apache-2.0Stargazers:10Issues:2Issues:0

PodcastFillers_Utils

Utility functions for preprocessing PodcastFillers dataset

Language:PythonLicense:NOASSERTIONStargazers:9Issues:2Issues:0

Filler-semi-CRF

Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]

Language:PythonLicense:MITStargazers:8Issues:4Issues:2

TDspkr-mismatch-study

Code base for "A study of the robustness of raw waveform based speaker embeddings under mismatched conditions"

Language:PythonLicense:MITStargazers:5Issues:1Issues:0

AudioDiffuser

Companion codebase for the paper "A Review on Score-based Generative Models for Audio Applications" (https://arxiv.org/abs/2506.08457)

Language:PythonLicense:MITStargazers:4Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:4Issues:1Issues:0

gzhu06.github.io

Personal webpage

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

openSFX-TFShard

A codebase for open source SFX data TFrecord sharding

Language:PythonLicense:MITStargazers:0Issues:1Issues:0