Zhongjie Ye (Vancause)

Vancause

Geek Repo

Company:Peking University

Github PK Tool:Github PK Tool

Zhongjie Ye's repositories

Language:PythonStargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audio-classifier

Classify sounds using YouTube-8M and VGGish models

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CDur

Repository for the paper "Towards duration robust weakly supervised sound event detection"

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

coala

COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations

License:MITStargazers:0Issues:0Issues:0

crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

License:MITStargazers:0Issues:0Issues:0

DCASE2021-Task1b

Audio-Visual Classifier in Acoustic Scene Clasification

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

DCASE2021_task6_v2

Code for CVSSP submission to DCASE 2021 Task 6

Stargazers:0Issues:0Issues:0

dcase_2020_T6

2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning-results#wuyusong2020_t6

Stargazers:0Issues:0Issues:0

deepsvg

[NeurIPS 2020] Official code for the paper "DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation". Includes a PyTorch library for deep learning with SVG data.

License:MITStargazers:0Issues:0Issues:0

DeepXi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

License:MPL-2.0Stargazers:0Issues:0Issues:0

dual_encoding

[CVPR2019] Dual Encoding for Zero-Example Video Retrieval

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

FullSubNet

PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

License:MITStargazers:0Issues:0Issues:0

HAKE-Action-Torch

HAKE-Action in PyTorch

License:Apache-2.0Stargazers:0Issues:0Issues:0

Meta-DETR

Meta-DETR: Official PyTorch Implementation

License:MITStargazers:0Issues:0Issues:0

PAGAN

PAGAN: a phase-adapted GAN for speech enhancement

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ppg-vc

PPG-Based Voice Conversion

License:Apache-2.0Stargazers:0Issues:0Issues:0

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

License:NOASSERTIONStargazers:0Issues:0Issues:0

retrieval-augmentation-nn

Generalization of deep neural networks by using the information of nearest training examples

Stargazers:0Issues:0Issues:0

SCAN

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
License:MITStargazers:0Issues:0Issues:0

SpeechSplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

License:MITStargazers:0Issues:0Issues:0

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

License:MITStargazers:0Issues:0Issues:0

vc_Real-Time-Voice-Cloning

clone Real-Time-Voice-Cloning to test

Stargazers:0Issues:2Issues:0

vcc20_baseline_cyclevae

Voice Conversion Challenge 2020 CycleVAE baseline system

License:MITStargazers:0Issues:0Issues:0