Wendy510's starred repositories

zju-icicles

浙江大学课程攻略共享计划

srs

SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.

100-Days-Of-ML-Code

100-Days-Of-ML-Code中文版

Language:Jupyter NotebookLicense:MITStargazers:20867Issues:1089Issues:55

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:6504Issues:56Issues:140

pandora

潘多拉,一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT client that lets you breathe freely.

Language:PythonLicense:GPL-2.0Stargazers:5438Issues:30Issues:255

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonLicense:MITStargazers:4281Issues:75Issues:166

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonLicense:MITStargazers:4160Issues:43Issues:99

Book

:green_book:我的个人书籍学习和收藏

Language:PythonLicense:MITStargazers:2545Issues:35Issues:1

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:1648Issues:27Issues:205

Speech-Emotion-Recognition

Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别

Language:PythonLicense:MITStargazers:901Issues:15Issues:50

AvStackDocs

音视频基础知识整理和相关协议文档说明

DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Language:PythonLicense:MITStargazers:550Issues:9Issues:76

python_ebook

收集了一些Python相关资料

Language:HTMLStargazers:471Issues:20Issues:0

ChatLLM

轻松玩转LLM兼容openai&langchain,支持文心一言、讯飞星火、腾讯混元、智谱ChatGLM等

Language:Jupyter NotebookLicense:MITStargazers:403Issues:8Issues:12

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Language:PythonLicense:MITStargazers:328Issues:20Issues:29

LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

Language:Jupyter NotebookStargazers:302Issues:16Issues:22

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language:PythonLicense:MITStargazers:302Issues:8Issues:15

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Language:PythonLicense:NOASSERTIONStargazers:262Issues:3Issues:20

speaker-recognition-py3

Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

Language:PythonLicense:Apache-2.0Stargazers:245Issues:12Issues:13

FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:217Issues:10Issues:12

pandora

ChatGPT Coding Unleashed! Pandora gives ChatGPT the ability to read and write files and run commands on your machine.

Language:PHPLicense:MITStargazers:102Issues:5Issues:7

x-vector-pytorch

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

Audio_Classification_using_LSTM

Classification of Urban Sound Audio Dataset using LSTM-based model.

Language:PythonLicense:MITStargazers:70Issues:3Issues:2

TDNN

PyTorch implementation of a Time Delay Neural Network (TDNN)

Language:PythonLicense:MITStargazers:39Issues:1Issues:0

Build-SE-Dataset

Build speech enhancement dataset.

Language:PythonLicense:MITStargazers:25Issues:2Issues:1

kaldi-script

初学者笔记不多

Stargazers:5Issues:0Issues:0

youtube-lid-data

Scripts for collecting audio data from Youtube for building spoken language identification models.

Language:PythonStargazers:3Issues:1Issues:0
Language:MATLABStargazers:2Issues:1Issues:0