Tianrui Wang (王天锐) (wangtianrui)

wangtianrui

Geek Repo

Company:BJTU

Location:Beijing

Github PK Tool:Github PK Tool


Organizations
android-nuc

Tianrui Wang (王天锐)'s repositories

OldPeopleHome

:fire:智能养老院项目

Language:CStargazers:63Issues:3Issues:0

HGCN

The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"

APC-SNR

Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch

My-notes

:books:学习随笔

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

DPCRN_DNS3

Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"

Language:PythonStargazers:1Issues:1Issues:0

onnx-simplifier

Simplify your onnx model

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

License:MITStargazers:1Issues:0Issues:0

RUI_SE

The official repo of "A Refining Underlying Information Framework for Speech Enhancement"

Language:PythonStargazers:0Issues:0Issues:0

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:0Issues:0Issues:0

asr_labs

ASR labs

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

BigVGAN

Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

paper2gui

Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

poolformer

PoolFormer: MetaFormer is Actually What You Need for Vision

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

QuadTreeAttention

QuadTree Attention for Vision Transformers (ICLR2022)

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

SDDNet

Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.

Language:PythonStargazers:0Issues:1Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Uformer

Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

vits_chinese

vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统,兼容性非常好的合成框架

Language:PythonStargazers:0Issues:1Issues:0

VoiceLDM

VoiceLDM: Text-to-Speech with Environmental Context

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0