cuichenrui2000

Chenrui Cui's repositories

barry_speech_tools

This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! 🚀🚀🚀

Language:PythonApache-2.05 10

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT000

ChenruiCui.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

Language:SCSSMIT000

dns_mos_calculate

Code for calculate DNS_MOS.

Language:Python000

fairseqDRAFT

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT000

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonMIT000

GitHub-Chinese-Top-Charts

:cn: GitHub中文排行榜，各语言分设「软件 | 资料」榜单，精准定位中文好项目。各取所需，高效学习。

Language:JavaNOASSERTION000

gss

A simple package for Guided source separation (GSS)

Language:PythonMIT000

ICMC-ASR_Baseline

The baseline system for the ICASSP2024 ICMC-ASR Challenge.

Language:PythonApache-2.0000

M2UGen

This is the official repository for M2UGen

Language:Jupyter NotebookMIT000

Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

Language:CApache-2.0000

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CMIT000

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-4-Clause000

MindSpore4Speech

000

open-speech-data

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

MIT000

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Apache-2.0000

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonMIT000