Queen_Wcy's repositories

espnet_tts_frontend

Text frontend for ESPnet tts recipes

Language:PythonStargazers:1Issues:0Issues:0

ai-deployment

关注AI模型上线、模型部署

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Computer-VisionandAudio-Lab

2018秋哈工大视听觉实验

Language:PythonStargazers:0Issues:0Issues:0

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

CycleGAN-VC2

Voice Conversion by CycleGAN (语音克隆/语音转换)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

License:Apache-2.0Stargazers:0Issues:0Issues:0

HiFi-GAN

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

langid.py

Stand-alone language identification system

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

LeetCodeAnimation

Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)

Language:JavaStargazers:0Issues:0Issues:0

line_profiler

(OLD REPO) Line-by-line profiling for Python - Current repo ->

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MS-Tacotron2

Tacotron2 based multi-speaker text to speech

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

papers-with-annotations

Research papers with annotations, illustrations and explanations

License:MITStargazers:0Issues:0Issues:0

PPSpeech

PPSpeech: Phrase based Parallel End-to-End TTS System

Stargazers:0Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

License:MITStargazers:0Issues:0Issues:0

Shenlan-ASR-Course

深蓝学院语音课程《语音识别从入门到精通》课程作业

Stargazers:0Issues:0Issues:0

speech-synthesis-paper

List of speech synthesis papers.

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

SqueezeFlow

Code Repository for "SqueezeFlow: Adaptive Text-to-Speech in Low Computational Resource Scenarios"

License:Apache-2.0Stargazers:0Issues:0Issues:0

tacotron2

Forked from NVIDIA/tacotron2 and merged with Rayhane-mamah/Tacotron-2

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Tacotron2_batch_inference

Pytorch tacotron2 that can be used to perform batch inference

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese and Easy to adapt for other languages)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Voice-synthesis

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Language:PythonStargazers:0Issues:0Issues:0

WavAugment

A library for speech data augmentation in time-domain

License:MITStargazers:0Issues:0Issues:0

wavegrad

A fast, high-quality neural vocoder.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

zhvoice

Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。

Stargazers:0Issues:0Issues:0