Chenrui Cui's repositories
barry_speech_tools
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! ๐๐๐
Amphion
Amphion (/รฆmหfaษชษn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
ChenruiCui.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
dns_mos_calculate
Code for calculate DNS_MOS.
fairseqDRAFT
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
faster-whisper
Faster Whisper transcription with CTranslate2
GitHub-Chinese-Top-Charts
:cn: GitHubไธญๆๆ่กๆฆ๏ผๅ่ฏญ่จๅ่ฎพใ่ฝฏไปถ | ่ตๆใๆฆๅ๏ผ็ฒพๅๅฎไฝไธญๆๅฅฝ้กน็ฎใๅๅๆ้๏ผ้ซๆๅญฆไน ใ
gss
A simple package for Guided source separation (GSS)
ICMC-ASR_Baseline
The baseline system for the ICASSP2024 ICMC-ASR Challenge.
M2UGen
This is the official repository for M2UGen
Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
whisper.cpp
Port of OpenAI's Whisper model in C/C++
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
open-speech-data
๐ A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
peft
๐ค PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
pytorch-book
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (ใๆทฑๅบฆๅญฆไน ๆกๆถPyTorch๏ผๅ ฅ้จไธๅฎๆใ)
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
RUI_SE
The official repo of "A Refining Underlying Information Framework for Speech Enhancement"
so-vits-svc
SoftVC VITS Singing Voice Conversion
UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo