Beast code in Giters

Jun Xue's starred repositories

asvspoof5

Language:Python2100

audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Language:PythonNOASSERTION7300

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonApache-2.0624500

STG-Mamba

Official Implementation of STG-Mamba: Spatial-Temporal Graph Learning via Selective State Space Model.

Language:Python12500

auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

Language:PythonApache-2.015100

SW-WaveNet4ASD

Language:Python700

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT2889500

STgram-MFN

A spectro-temporal fusion feature, STgram, with MobileFaceNet For more stable Anomalous Sound Detection

Language:Python5800

AADCL

PyTorch implementation of the paper "Semi-Supervised Acoustic Anomaly Detection via Contrastive Learning"

Language:PythonNOASSERTION1600

ocnet

Language:PythonMIT500

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language:Python50200

pearxwebsite

2200

GAT

Graph Attention Networks (https://arxiv.org/abs/1710.10903)

Language:PythonMIT313200

FSD-Dataset

This repository presents a subset of our proposed FSD dataset for song deepfake detection.

Language:Python1900

ConvTran

This is a PyTorch implementation of ConvTran

Language:PythonMIT10500

tdfbanks

Pytorch implementation of time-domain filterbanks

Language:PythonNOASSERTION11000

Fastaudio

FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge

Language:Python4100

SpeechFormer2

SpeechFormer++ in PyTorch

Language:Python3600

ScConv

SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy

Language:Python24100

torch-cam

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

Language:PythonApache-2.0190300

SSL_Anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

Language:PythonMIT8700

w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Language:Jupyter NotebookMIT42100

RawBoost-antispoofing

This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".

Language:PythonMIT4400

chatgpt-on-wechat

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

Language:PythonMIT2811500