Hui Wang's starred repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Awesome-Learning-with-Label-Noise
A curated list of resources for Learning with Noisy Labels
DeepFilterNet
Noise supression using deep filtering
pytorch-lightning-template
An easy/swift-to-adapt PyTorch-Lighting template. 套壳模板,简单易用,稍改原来Pytorch代码,即可适配Lightning。You can translate your previous Pytorch code much easier using this template, and keep your freedom to edit all the functions as well. Big-project-friendly as well. No need to rewrite your config in hydra.
onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
voxceleb_trainer
In defence of metric learning for speaker recognition
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
Dual-Path-RNN-Pytorch
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
ava-dataset
The AVA dataset densely annotates 80 atomic visual actions in 351k movie clips with actions localized in space and time, resulting in 1.65M action labels with multiple labels per human occurring frequently.
Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
voxconverse
Spot the conversation: speaker diarisation in the wild
Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
MossFormer2
This is the audio sample repository for speech separation model "MossFormer2".
speaker_extraction_SpEx
multi-scale time domain speaker extraction
ScriptsForVoxBlink
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.