twisted's repositories

AliMeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

Language:PythonStargazers:0Issues:0Issues:0

aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

asr_project_template

Template for ASR project

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ASV-Anti-Spoofing-DADA

Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.

Language:PythonStargazers:0Issues:0Issues:0

Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

Stargazers:0Issues:0Issues:0

byol-a

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Duality-Temporal-Channel-Frequency-Attention-Enhanced-Speaker-Representation-Learning

Unofficial implementation of https://arxiv.org/abs/2110.06565 (for speaker verification)

Language:PythonStargazers:0Issues:1Issues:0

ECAPATDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Language:PythonStargazers:0Issues:0Issues:0

EPSANet

EPSANet

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

hyperion

Python toolkit for speech processing

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

LAGConv

lagconv

Language:PythonStargazers:0Issues:0Issues:0

Loss-Gated-Learning

ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MST-GCN

This is the official implemntation for "Multi-scale spatial temporal graph convolutional network for skeleton-based action recognition" AAAI-2021

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

New-Grad-Positions-2022

A collection of New Grad full time roles in SWE, Quant, and PM.

Stargazers:0Issues:0Issues:0

RawBoost-antispoofing

This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

solo-learn

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

sr_labs_book

The project is related to the development of labs for the ITMO Speaker Recognition Course.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

ssl-for-slr

Collection of self-supervised models for speaker and language recognition tasks.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

SSL_Anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

StreamingSpeakerDiarization

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TorchSSL

A PyTorch-based library for semi-supervised learning (NeurIPS'21)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:0Issues:0Issues:0

TVConv

[CVPR 2022] TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TWIST

Official codes: Self-Supervised Learning by Estimating Twin Class Distribution

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

VisionXformer

Vision Xformers

Language:PythonLicense:CC0-1.0Stargazers:0Issues:0Issues:0

WAEN

Wavelet Attention Embedding Networks for Video Super-Resolution (ICPR 2020) - Official Repository

Language:PythonStargazers:0Issues:0Issues:0

WaveletAttention

Wavelet-Attention CNN for Image Classification

Language:PythonStargazers:0Issues:1Issues:0

WaveMix

2D discrete Wavelet Transform for Image Classification

Language:PythonStargazers:0Issues:0Issues:0