88aggressive

liu shuanghong's repositories

Loss-Gated-Learning

ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'

MIT000

TS-TalkNet

INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues

100

Agglomerative-Hierarchical-Clustering-from-scratch

Build Agglomerative hierarchical clustering algorithm from scratch, i.e. WITHOUT any advance libraries such as Numpy, Pandas, Scikit-learn, etc.

000

openTSNE

Extensible, parallel implementations of t-SNE

BSD-3-Clause000

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Apache-2.0000

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

MIT000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT000

SV_eval_protocols_for_SD

Speaker verification evaluation protocols simulating speaker diarisation

MIT000

D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

000

dover-lap

Python package for combining diarization system outputs.

MIT000

KeepChatGPT

让我们在使用ChatGPT过程中更高效、更顺畅，完美解决ChatGPT网络错误，不再频繁地刷新网页，足足省去10个多余的步骤。还可以取消后台监管审计。解决了这几类报错: (1) NetworkError when attempting to fetch resource. (2) Something went wrong. If this issue persists please contact us through our help center at help.openai.com. (3) This content may violate our content policy. (4) Conversation not found.

GPL-2.0000

Conv-TasNet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

MIT000