wangtianrui

followers

following

stars

BJTU

Beijing

Organizations

android-nuc

Tianrui Wang (王天锐)'s repositories

OldPeopleHome

:fire:智能养老院项目

Language:C63 30

HGCN

The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"

Language:Python51 2 3

APC-SNR

Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch

Language:Python27 3 3

My-notes

:books:学习随笔

Language:JavaScript17 2 13

MindSpore4Speech

Language:Python3 10

Audio-Enhancement-via-ONMF

Language:PythonMIT1 10

DPCRN_DNS3

Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"

Language:Python1 10

onnx-simplifier

Simplify your onnx model

Language:PythonApache-2.01 10

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

MIT100

RUI_SE

The official repo of "A Refining Underlying Information Framework for Speech Enhancement"

Language:Python000

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:Python000

asr_labs

ASR labs

Language:Jupyter NotebookMIT010

BigVGAN

Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training

MIT000

conditional-flow-matching

MIT000

EnCodec_Trainer

Language:Python000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT010

paper2gui

Convert AI papers to GUI，Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

Language:Jupyter NotebookMIT010

poolformer

PoolFormer: MetaFormer is Actually What You Need for Vision

Language:Jupyter NotebookApache-2.0010

QuadTreeAttention

QuadTree Attention for Vision Transformers (ICLR2022)

Language:Jupyter Notebook010

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonApache-2.0010

SDDNet

Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.

Language:Python010

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

Language:PythonMIT010

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0000

Uformer

Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation

Language:Python010

UniSpeech

Language:PythonNOASSERTION010

vits_chinese

vits chinese, tts chinese, tts mandarin 史上训练最简单，音质最好的语音合成系统，兼容性非常好的合成框架

Language:Python010

VoiceLDM

VoiceLDM: Text-to-Speech with Environmental Context

Apache-2.0000

voiceldm-data

Apache-2.0000

wangtianrui.github.io

resume

Language:HTML020

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++Apache-2.0010