twisted's repositories
AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.
aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
asr_project_template
Template for ASR project
ASV-Anti-Spoofing-DADA
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Duality-Temporal-Channel-Frequency-Attention-Enhanced-Speaker-Representation-Learning
Unofficial implementation of https://arxiv.org/abs/2110.06565 (for speaker verification)
ECAPATDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
LAGConv
lagconv
Loss-Gated-Learning
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
MST-GCN
This is the official implemntation for "Multi-scale spatial temporal graph convolutional network for skeleton-based action recognition" AAAI-2021
New-Grad-Positions-2022
A collection of New Grad full time roles in SWE, Quant, and PM.
RawBoost-antispoofing
This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".
solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
sr_labs_book
The project is related to the development of labs for the ITMO Speaker Recognition Course.
ssl-for-slr
Collection of self-supervised models for speaker and language recognition tasks.
SSL_Anti-spoofing
This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".
StreamingSpeakerDiarization
Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"
TorchSSL
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
TVConv
[CVPR 2022] TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
TWIST
Official codes: Self-Supervised Learning by Estimating Twin Class Distribution
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
VisionXformer
Vision Xformers
WAEN
Wavelet Attention Embedding Networks for Video Super-Resolution (ICPR 2020) - Official Repository
WaveletAttention
Wavelet-Attention CNN for Image Classification
WaveMix
2D discrete Wavelet Transform for Image Classification