Huang-Cheng, Chou's starred repositories
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
promptbench
A unified evaluation framework for large language models
DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
MultiBench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
BitcoinArmory
Python-Based Bitcoin Software
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
w2v2-how-to
How to use our public wav2vec2 dimensional emotion model
pfl-research
Simulation framework for accelerating research in Private Federated Learning
AdamW-and-SGDW
Decoupled Weight Decay Regularization (ICLR 2019)
Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
dynamic-superb
The official repository of Dynamic-SUPERB.
calibration_library
Pytorch library for model calibration metrics and visualizations as well as recalibration methods. In progress!
imbalanced-DL
A Python Package for Deep Imbalanced Learning
bdl-rul-svgd
Bayesian deep learning for remaining useful life estimation via Stein variational gradient descent
Anxiety-Detection-from-free-form-audio-journals
Repository for CS224S project: Detecting anxiety from short clips of free-form speech