Tianrui Wang (王天锐)'s repositories
OldPeopleHome
:fire:智能养老院项目
DPCRN_DNS3
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
onnx-simplifier
Simplify your onnx model
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
RUI_SE
The official repo of "A Refining Underlying Information Framework for Speech Enhancement"
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
BigVGAN
Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training
poolformer
PoolFormer: MetaFormer is Actually What You Need for Vision
QuadTreeAttention
QuadTree Attention for Vision Transformers (ICLR2022)
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
vits_chinese
vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统,兼容性非常好的合成框架
VoiceLDM
VoiceLDM: Text-to-Speech with Environmental Context
wangtianrui.github.io
resume