Hsuan-Fu Wang's repositories
SpeechCLIP_plus
SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data. Accepted to ICASSP 2024, Self-supervision in Audio, Speech, and Beyond (SASB) workshop.
2023-Fall-ADL
Homeworks of NTU 2023 Fall Applied deep learning
2023-Fall-DeepMIR
2023 Fall NTU 深度學習於音樂分析及生成
NeuralSVB
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
oh-my-bash
A delightful community-driven framework for managing your bash configuration, and an auto-update tool so that makes it easy to keep up with the latest updates from the community.
qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Rank-N-Contrast
[NeurIPS 2023, Spotlight] Rank-N-Contrast: Learning Continuous Representations for Regression
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
SpeechCLIP_test
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
word-discovery
Word Discovery in Visually Grounded, Self-Supervised Speech Models