wzy's repositories

AEGAN-AD

Official pytorch implementation of AEGAN-AD

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bert4torch

pytorch implement of transformers refer to bert4keras

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Bert-VITS2-Integration-train-txt-infer

适配windows的requirements.txt,加了个长文本分段推理和手机听书的api,非本专业,屎山代码

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:MakefileLicense:MITStargazers:0Issues:0Issues:0

emotional-vits

无需情感标注的情感可控语音合成模型,基于VITS

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

FastASR

这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 推理模型是基于目前最先进的conformer模型,使用10000+小时的wenetspeech数据集训练得到, 所以识别效果也很好,可以媲美许多商用的ASR软件。

Language:CLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Genshin_Datasets

Genshin Datasets For SVC/SVS/TTS

Stargazers:0Issues:0Issues:0

GenshinVoice

Voice dataset of Genshin Impact 原神语音数据集

Stargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MoeGoe

Executable file for VITS inference

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

porcupine

On-device wake word detection powered by deep learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pykaldi

A Python wrapper for Kaldi

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

realtime-vad-sample

Sample code of real-time voice activity detection using webrtcvad.

Language:PythonStargazers:0Issues:0Issues:0

RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription. Designed for real-time applications like voice assistants.

Language:PythonStargazers:0Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

travel-chatbot

This project implements a travel chatbot powered by the RAG (Retrieve and Generate) chain, providing real-time information retrieval using various tools and the ability to fetch weather reports.

License:MITStargazers:0Issues:0Issues:0

vits

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support streaming out!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Whisper-Finetune

微调Whisper模型和加速推理

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

whisper-finetuning

[WIP] Scripts for fine-tuning Whisper

Language:PythonLicense:MITStargazers:0Issues:0Issues:0