Jun Xue (JunXue-tech)

JunXue-tech

Geek Repo

Company:Anhui university

Location:China

Github PK Tool:Github PK Tool

Jun Xue's starred repositories

Language:PythonStargazers:21Issues:0Issues:0

audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Language:PythonLicense:NOASSERTIONStargazers:73Issues:0Issues:0

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:6245Issues:0Issues:0

STG-Mamba

Official Implementation of STG-Mamba: Spatial-Temporal Graph Learning via Selective State Space Model.

Language:PythonStargazers:125Issues:0Issues:0

auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

Language:PythonLicense:Apache-2.0Stargazers:151Issues:0Issues:0
Language:PythonStargazers:7Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:28895Issues:0Issues:0

STgram-MFN

A spectro-temporal fusion feature, STgram, with MobileFaceNet For more stable Anomalous Sound Detection

Language:PythonStargazers:58Issues:0Issues:0

AADCL

PyTorch implementation of the paper "Semi-Supervised Acoustic Anomaly Detection via Contrastive Learning"

Language:PythonLicense:NOASSERTIONStargazers:16Issues:0Issues:0
Language:PythonLicense:MITStargazers:5Issues:0Issues:0

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language:PythonStargazers:502Issues:0Issues:0

GAT

Graph Attention Networks (https://arxiv.org/abs/1710.10903)

Language:PythonLicense:MITStargazers:3132Issues:0Issues:0

FSD-Dataset

This repository presents a subset of our proposed FSD dataset for song deepfake detection.

Language:PythonStargazers:19Issues:0Issues:0

ConvTran

This is a PyTorch implementation of ConvTran

Language:PythonLicense:MITStargazers:105Issues:0Issues:0

tdfbanks

Pytorch implementation of time-domain filterbanks

Language:PythonLicense:NOASSERTIONStargazers:110Issues:0Issues:0

Fastaudio

FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge

Language:PythonStargazers:41Issues:0Issues:0

SpeechFormer2

SpeechFormer++ in PyTorch

Language:PythonStargazers:36Issues:0Issues:0

ScConv

SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy

Language:PythonStargazers:241Issues:0Issues:0

torch-cam

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

Language:PythonLicense:Apache-2.0Stargazers:1903Issues:0Issues:0

SSL_Anti-spoofing

This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".

Language:PythonLicense:MITStargazers:87Issues:0Issues:0

w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Language:Jupyter NotebookLicense:MITStargazers:421Issues:0Issues:0

RawBoost-antispoofing

This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".

Language:PythonLicense:MITStargazers:44Issues:0Issues:0

chatgpt-on-wechat

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Language:PythonLicense:MITStargazers:28115Issues:0Issues:0

aasist

Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"

Language:PythonLicense:MITStargazers:146Issues:0Issues:0

Self-Distillation

Improve a Model's accuracy by distilling knowledge to the earlier layers of the model. Improves accuracy and performance of lightweight DNN models

Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:308Issues:0Issues:0

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonLicense:BSD-2-ClauseStargazers:2429Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29811Issues:0Issues:0

leaf-pytorch

PyTorch implementation of the LEAF audio frontend

Language:PythonStargazers:63Issues:0Issues:0