lbqin's repositories

speech-vad-demo

集成Webrtc的VAD,用于切分音频文件

Language:CStargazers:1Issues:2Issues:0

SpeechSynthesis

语音合成综述

License:Apache-2.0Stargazers:1Issues:2Issues:0

aichallenge

xunfei dialect baseline

Language:PythonStargazers:0Issues:2Issues:1
Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

cppjieba

"结巴"中文分词的C++版本

Language:C++Stargazers:0Issues:0Issues:0

darknet

Convolutional Neural Networks

Language:CLicense:NOASSERTIONStargazers:0Issues:2Issues:0

deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System https://arxiv.org/pdf/1705.02304.pdf

Language:PythonStargazers:0Issues:2Issues:0

GCommandsPytorch

ConvNets for Audio Recognition using Google Commands Dataset

Language:PythonStargazers:0Issues:2Issues:0

kaldi

This is now the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:2Issues:0

kaldi-enhan

Tools for speech enhancement based on kaldi

Language:C++Stargazers:0Issues:2Issues:0

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

mace

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Language:JavaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

merlin

This is now the official location of the Merlin project.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

ML-KWS-for-MCU

Keyword spotting on Arm Cortex-M Microcontrollers

Language:CLicense:Apache-2.0Stargazers:0Issues:2Issues:0

MMdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

MTTS

A Demo of Mandarin/Chinese TTS frontend

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

parallel_wavenet_vocoder

Parallel WaveNet Vocoder Based on ClariNet

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

Sinsy-Remix

The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"

Language:C++License:MITStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0

tacotron-1

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

THULAC

An Efficient Lexical Analyzer for Chinese

Language:C++License:MITStargazers:0Issues:2Issues:0

TTS

Deep learning for Text2Speech

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:0Issues:2Issues:0

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Language:MatlabStargazers:0Issues:2Issues:0

web-speech-api

A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.

Language:JavaScriptLicense:CC0-1.0Stargazers:0Issues:2Issues:0

World

A high-quality speech analysis, manipulation and synthesis system

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0