Pan Zexu (zexupan)

zexupan

Geek Repo

Company:National University of Singapore

Location:Singapore

Github PK Tool:Github PK Tool

Pan Zexu's starred repositories

cocktail-fork-separation

Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset

Language:PythonLicense:MITStargazers:70Issues:0Issues:0

FlatTrajectoryDistillation_FTD

The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)

Stargazers:17Issues:0Issues:0

EE4208ComputerVision

Face Detection

Language:PythonStargazers:2Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0

Waveformer

A deep neural network architecture for low-latency audio processing

Language:PythonLicense:MITStargazers:273Issues:0Issues:0

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:PythonLicense:MITStargazers:269Issues:0Issues:0
Language:PythonStargazers:11Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8020Issues:0Issues:0

dscore

Diarization scoring tools.

Language:PythonLicense:BSD-2-ClauseStargazers:198Issues:0Issues:0

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1043Issues:0Issues:0

PaSST

Efficient Training of Audio Transformers with Patchout

Language:PythonLicense:Apache-2.0Stargazers:283Issues:0Issues:0
Language:PythonStargazers:42Issues:0Issues:0

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

Language:PythonLicense:MITStargazers:463Issues:0Issues:0
Language:PythonStargazers:14Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0

Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

Stargazers:1053Issues:0Issues:0
Language:PythonStargazers:14Issues:0Issues:0
Language:PythonStargazers:29Issues:0Issues:0

Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Language:Jupyter NotebookLicense:MITStargazers:3931Issues:0Issues:0

speaker_extraction

target speaker extraction and verification for multi-talker speech

Language:PythonLicense:GPL-3.0Stargazers:146Issues:0Issues:0

youtube-gesture-dataset

This repository contains scripts to build Youtube Gesture Dataset.

Language:PythonLicense:BSD-3-ClauseStargazers:108Issues:0Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:1504Issues:0Issues:0

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:8950Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:31164Issues:0Issues:0

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Stargazers:624Issues:0Issues:0

FastSpeech

The Implementation of FastSpeech based on pytorch.

Language:PythonLicense:MITStargazers:846Issues:0Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:1656Issues:0Issues:0

Contrastive-Predictive-Coding-PyTorch

Contrastive Predictive Coding for Automatic Speaker Verification

Language:PythonLicense:MITStargazers:470Issues:0Issues:0

pystoi

Python implementation of the Short Term Objective Intelligibility measure

Language:MATLABLicense:MITStargazers:311Issues:0Issues:0