Chenrui Cui (cuichenrui2000)

cuichenrui2000

Geek Repo

Company:Tianjin University

Location:Beijing, China

Github PK Tool:Github PK Tool

Chenrui Cui's repositories

barry_speech_tools

This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets related to environment setup, data preprocessing, speech frontend, speech recognition, voice conversion, speech synthesis, and more. Let's explore the fascinating world of speech processing together! ๐Ÿš€๐Ÿš€๐Ÿš€

Language:PythonLicense:Apache-2.0Stargazers:5Issues:1Issues:0

Amphion

Amphion (/รฆmหˆfaษชษ™n/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ChenruiCui.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

Language:SCSSLicense:MITStargazers:0Issues:0Issues:0

dns_mos_calculate

Code for calculate DNS_MOS.

Language:PythonStargazers:0Issues:0Issues:0

fairseqDRAFT

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

GitHub-Chinese-Top-Charts

:cn: GitHubไธญๆ–‡ๆŽ’่กŒๆฆœ๏ผŒๅ„่ฏญ่จ€ๅˆ†่ฎพใ€Œ่ฝฏไปถ | ่ต„ๆ–™ใ€ๆฆœๅ•๏ผŒ็ฒพๅ‡†ๅฎšไฝไธญๆ–‡ๅฅฝ้กน็›ฎใ€‚ๅ„ๅ–ๆ‰€้œ€๏ผŒ้ซ˜ๆ•ˆๅญฆไน ใ€‚

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

gss

A simple package for Guided source separation (GSS)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ICMC-ASR_Baseline

The baseline system for the ICASSP2024 ICMC-ASR Challenge.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

M2UGen

This is the official repository for M2UGen

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CLicense:MITStargazers:0Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-4-ClauseStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

open-speech-data

๐Ÿ’Ž A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

License:MITStargazers:0Issues:0Issues:0

peft

๐Ÿค— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch-book

PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (ใ€Šๆทฑๅบฆๅญฆไน ๆก†ๆžถPyTorch๏ผšๅ…ฅ้—จไธŽๅฎžๆˆ˜ใ€‹)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RUI_SE

The official repo of "A Refining Underlying Information Framework for Speech Enhancement"

Language:PythonStargazers:0Issues:0Issues:0
Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

License:Apache-2.0Stargazers:0Issues:0Issues:0