xmpx

0

followers

following

stars

xmpx's repositories

SFANC-Window

Real-time Implementation of CNN-based selective fixed-filter active noise control and effectiveness analysis using explainable AI

Language:Python100

aps

My workspace for single/multi-channel speech enhancement & separation & recognition.

Language:PythonApache-2.0010

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT010

asteroid_gan_exps

GAN experiments with Asteroid

Language:PythonMIT010

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

GPL-2.0000

bob-plugin-openai-translator

基于 ChatGPT API 的文本翻译、文本润色、语法纠错 Bob 插件，让我们一起迎接不需要巴别塔的新时代！

Language:JavaScript000

bottleneck-transformer-pytorch

Implementation of Bottleneck Transformer in Pytorch

MIT000

BottleneckTransformers

Bottleneck Transformers for Visual Recognition

MIT000

ConferencingSpeech2022

Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications

Apache-2.0000

Conformer

Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition

Language:Jupyter NotebookApache-2.0000

deeplearningsourceseparation

Deep Recurrent Neural Networks for Source Separation

NOASSERTION000

DPTNet

Language:Python010

EEND

End-to-End Neural Diarization

MIT000

ElectronBot

000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0010

FRA-RIR

Language:PythonApache-2.0000

GC3

Language:PythonMIT000

involution

[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator

Language:PythonMIT010

Knowledge-Distillation-Zoo

Pytorch implementation of various Knowledge Distillation (KD) methods.

000

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++NOASSERTION000

openspeech

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Language:PythonMIT000

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Apache-2.0000

SoundSourceSeparation

The code for multi-channel speech enhancement and source separation such as MNMF, MNMF_DP, ILRMA, ILRMA_DP, FastMNMF, FastMNMF_DP, FCA, FastFCA

Language:PythonMIT010

speech-separation

Language:MATLAB010

speechbrain

A PyTorch-based Speech Toolkit

Apache-2.0000

SSL-pretraining-separation

Official repository of our paper: https://arxiv.org/pdf/2010.15366.pdf

Language:Python010

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

MIT000

TAC

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Language:Python010

Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

000

vae_dolphin

NOASSERTION000