Beast code in Giters

Tianrui Wang (王天锐)'s repositories

OldPeopleHome

:fire:智能养老院项目

Language:C70 30

HGCN

The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"

Language:Python54 2 3

APC-SNR

Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch

Language:Python28 3 3

PM-EVC

This is the official implement of A Controllable Emotion Voice Conversion Framework with Pre-trained Speech Representations

Language:Python25 20

My-notes

:books:学习随笔

Language:JavaScript18 2 13

ProgRE

Language:Python17 10

MindSpore4Speech

Language:Python3 10

DPCRN_DNS3

Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"

Language:Python1 10

FAcodec

Training code for FAcodec presented in NaturalSpeech3

Language:Python100

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonMIT100

RUI_SE

The official repo of "A Refining Underlying Information Framework for Speech Enhancement"

Language:Python000

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Language:Python000

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:Python000

asr_labs

ASR labs

Language:Jupyter NotebookMIT010

BigVGAN

Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training

Language:PythonMIT000

conditional-flow-matching

Language:PythonMIT000

EnCodec_Trainer

Language:Python000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT010

paper2gui

Convert AI papers to GUI，Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

Language:Jupyter NotebookMIT010

poolformer

PoolFormer: MetaFormer is Actually What You Need for Vision

Language:Jupyter NotebookApache-2.0010

QuadTreeAttention

QuadTree Attention for Vision Transformers (ICLR2022)

Language:Jupyter Notebook010

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonApache-2.0010

seed-tts-eval

Language:Python000

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

Language:PythonMIT010

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0000

Uformer

Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation

Language:Python010

UniSpeech

Language:PythonNOASSERTION010

VoiceLDM

VoiceLDM: Text-to-Speech with Environmental Context

Language:PythonApache-2.0000

voiceldm-data

Apache-2.0000

wangtianrui.github.io

resume

Language:HTML020