Tao Liu (liutaocode)

liutaocode

Geek Repo

Company:@X-LANCE

Github PK Tool:Github PK Tool

Tao Liu's repositories

TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Language:PythonLicense:Apache-2.0Stargazers:53Issues:9Issues:0

talking_face_preprocessing

Preprocessing Scipts for Talking Face Generation

Language:PythonStargazers:39Issues:4Issues:0

talking-face-arxiv-daily

🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.

Language:PythonLicense:Apache-2.0Stargazers:28Issues:5Issues:0

AwesomeDiarizationDataset

Both audio-only and audio-visual speaker diarization datasets are listed here.

DiarizationMetricInOne

Diarization Metric in One: current support DER, JER, CDER, SER, and BER

Language:PythonStargazers:5Issues:1Issues:0

DiarizationVisualization

Visualization tools for audio-only and multi-modal speaker diarization dataset

Language:HTMLStargazers:4Issues:1Issues:0

AwesomeTokenizer

MultiModal Tokenizer Resources

BER

Balanced Error Rate for Speaker Diarization

Language:PythonStargazers:1Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

dscore-ovl

Detailed information for diarization metric: dscore, including errors in overlapped part.

Language:PythonLicense:BSD-2-ClauseStargazers:1Issues:0Issues:0

EEND_PyTorch

A PyTorch implementation of End-to-End Neural Diarization

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Multi-modal-Speech-Dataset

Multi-modal Speech Dataset