Di Wu (whiteshirt0429)

whiteshirt0429

Geek Repo

Location:beijing

Github PK Tool:Github PK Tool

Di Wu's starred repositories

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4442Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:33325Issues:0Issues:0
Language:PythonStargazers:923Issues:0Issues:0

Speech-Resources

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

Stargazers:482Issues:0Issues:0

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

License:MITStargazers:1263Issues:0Issues:0

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:559Issues:0Issues:0

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:2227Issues:0Issues:0

sound_generation

Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021

Language:PythonStargazers:68Issues:0Issues:0

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Language:CudaLicense:Apache-2.0Stargazers:1107Issues:0Issues:0

leetcode-master

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Language:ShellStargazers:50432Issues:0Issues:0

hello-algo

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Language:JavaLicense:NOASSERTIONStargazers:94742Issues:0Issues:0

wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Language:PythonLicense:Apache-2.0Stargazers:435Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5855Issues:0Issues:0
Language:PowerShellLicense:Apache-2.0Stargazers:2Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:4069Issues:0Issues:0
Language:PythonStargazers:13Issues:0Issues:0

wecut

video cut powered by AI

Stargazers:25Issues:0Issues:0

WeSpeech-AI

Open Source Speech/Text Data on AI

Stargazers:18Issues:0Issues:0

WeTextProcessing

Text Normalization & Inverse Text Normalization

Language:PythonLicense:Apache-2.0Stargazers:446Issues:0Issues:0

Leaderboard

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Language:PythonStargazers:427Issues:0Issues:0

SummaryOfLoanSuspension

全国各省市停贷通知汇总

Language:HTMLStargazers:20354Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:64Issues:0Issues:0

kaldifeat

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

Language:C++License:NOASSERTIONStargazers:186Issues:0Issues:0

3m-asr

3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition

Language:PythonLicense:Apache-2.0Stargazers:119Issues:0Issues:0

vnpy

基于Python的开源量化交易平台开发框架

Language:PythonLicense:MITStargazers:24441Issues:0Issues:0

opencpop

Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis

Stargazers:208Issues:0Issues:0

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonLicense:MITStargazers:620Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2207Issues:0Issues:0
Stargazers:1Issues:0Issues:0

GigaSpeech

Large, modern dataset for speech recognition

Language:ShellLicense:Apache-2.0Stargazers:625Issues:0Issues:0