Tao Liu's repositories
TTS-arxiv-daily
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
talking_face_preprocessing
Preprocessing Scipts for Talking Face Generation
talking-face-arxiv-daily
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
AwesomeDiarizationDataset
Both audio-only and audio-visual speaker diarization datasets are listed here.
DiarizationMetricInOne
Diarization Metric in One: current support DER, JER, CDER, SER, and BER
DiarizationVisualization
Visualization tools for audio-only and multi-modal speaker diarization dataset
AwesomeTokenizer
MultiModal Tokenizer Resources
dscore-ovl
Detailed information for diarization metric: dscore, including errors in overlapped part.
EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization
Multi-modal-Speech-Dataset
Multi-modal Speech Dataset