Victor Chen's repositories
MTDVocaLiST
Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).
awesome-audio-visual-deepfake
awesome-audio-visual-robustness
NTU_FinTech
NTU_FinTech
awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
dynamic-superb
The unofficial repository of Dynamic-SUPERB.
end-to-end-lipreading
Pytorch code for End-to-End Audiovisual Speech Recognition
IR-Programming-HW2
Web Retrieval and Mining 2020 (CSIE 5137) - Programming Homework 2
machine-learning-notes
This contains my machine learning notes in latex form
mlpack
mlpack: a scalable C++ machine learning library --
Mandarin-Wav2Vec2
Pre-trained Wav2vec2.0 for Mandarin
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
vocalist
Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices