Pan Zexu's starred repositories
Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
FastSpeech
The Implementation of FastSpeech based on pytorch.
awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
Contrastive-Predictive-Coding-PyTorch
Contrastive Predictive Coding for Automatic Speaker Verification
TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Waveformer
A deep neural network architecture for low-latency audio processing
speaker_extraction
target speaker extraction and verification for multi-talker speech
youtube-gesture-dataset
This repository contains scripts to build Youtube Gesture Dataset.
cocktail-fork-separation
Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset
FlatTrajectoryDistillation_FTD
The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)
EE4208ComputerVision
Face Detection