Wendy510

followers

following

stars

Wendy510's starred repositories

zju-icicles

浙江大学课程攻略共享计划

Language:HTML36291 1103 74

srs

SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.

Language:C++MIT24472 846 1435

100-Days-Of-ML-Code

100-Days-Of-ML-Code中文版

Language:Jupyter NotebookMIT20867 1089 55

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.06504 56 140

pandora

潘多拉，一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT client that lets you breathe freely.

Language:PythonGPL-2.05438 30 255

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonMIT4281 75 166

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonMIT4160 43 99

Book

:green_book:我的个人书籍学习和收藏

Language:PythonMIT2545 35 1

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonMIT1648 27 205

Speech-Emotion-Recognition

Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别

Language:PythonMIT901 15 50

AvStackDocs

音视频基础知识整理和相关协议文档说明

Language:HTML745 47 2

DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Language:PythonMIT550 9 76

python_ebook

收集了一些Python相关资料

Language:HTML471 200

ChatLLM

轻松玩转LLM兼容openai&langchain，支持文心一言、讯飞星火、腾讯混元、智谱ChatGLM等

Language:Jupyter NotebookMIT403 8 12

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Language:PythonMIT328 20 29

LSTM_PIT_Speech_Separation

Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.

Language:Jupyter Notebook302 16 22

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language:PythonMIT302 8 15

Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Language:PythonNOASSERTION262 3 20

speaker-recognition-py3

Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

Language:PythonApache-2.0245 12 13

FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Language:Jupyter NotebookApache-2.0217 10 12

pandora

ChatGPT Coding Unleashed! Pandora gives ChatGPT the ability to read and write files and run commands on your machine.

Language:PHPMIT102 5 7

x-vector-pytorch

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

Language:Python96 5 10

Audio_Classification_using_LSTM

Classification of Urban Sound Audio Dataset using LSTM-based model.

Language:PythonMIT70 3 2

TDNN

PyTorch implementation of a Time Delay Neural Network (TDNN)

Language:PythonMIT39 10

Build-SE-Dataset

Build speech enhancement dataset.

Language:PythonMIT25 2 1

speech_signal_processing

Language:Python16 3 2

kaldi-script

初学者笔记不多

500

youtube-lid-data

Scripts for collecting audio data from Youtube for building spoken language identification models.

Language:Python3 10

Matlab

Language:MATLAB2 10

MFCC-Analysis-and-K-means-Clustering

Language:MATLAB200