Tianrui Wang (王天锐)'s repositories
OldPeopleHome
:fire:智能养老院项目
DPCRN_DNS3
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
RUI_SE
The official repo of "A Refining Underlying Information Framework for Speech Enhancement"
SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
BigVGAN
Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training
poolformer
PoolFormer: MetaFormer is Actually What You Need for Vision
QuadTreeAttention
QuadTree Attention for Vision Transformers (ICLR2022)
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
VoiceLDM
VoiceLDM: Text-to-Speech with Environmental Context
wangtianrui.github.io
resume