Ge Zhu (朱舸) (gzhu06)

gzhu06

Geek Repo

Company:University of Rochester

Location:San Francisco

Home Page:gzhu06.github.io

Github PK Tool:Github PK Tool

Ge Zhu (朱舸)'s repositories

Y-vector

Y-vector: Multiscale Waveform Encoder for Speaker Embedding

Language:PythonStargazers:24Issues:2Issues:0

Cacophony

Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986

Language:PythonLicense:MITStargazers:23Issues:4Issues:2

GenerativeSourceSeparation

Open source code for the paper 'Music Source Separation with Generative Flow'

Language:Jupyter NotebookLicense:MITStargazers:20Issues:1Issues:1

Manifold-Constrained-Gradient-ipynb

Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/abs/2206.00941]

Language:Jupyter NotebookLicense:MITStargazers:12Issues:1Issues:2

Filler-semi-CRF

Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]

Language:PythonLicense:MITStargazers:8Issues:3Issues:1

Unconditional-Audio-Generation-Benchmark

Unconditional audio generation benchmark

Language:PythonLicense:Apache-2.0Stargazers:7Issues:2Issues:0

PodcastFillers_Utils

Utility functions for preprocessing PodcastFillers dataset

Language:PythonLicense:NOASSERTIONStargazers:6Issues:2Issues:0

TDspkr-mismatch-study

Code base for "A study of the robustness of raw waveform based speaker embeddings under mismatched conditions"

Language:PythonLicense:MITStargazers:5Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:4Issues:1Issues:0

gzhu06.github.io

Personal webpage

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

openSFX-TFShard

A codebase for open source SFX data TFrecord sharding

Language:PythonLicense:MITStargazers:0Issues:1Issues:0