Beast code in Giters

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.0000

Paddlespeech-Streaming-ASR-GUI

Language:PythonMIT000

porcupine

On-device wake word detection powered by deep learning

Language:PythonApache-2.0000

pykaldi

A Python wrapper for Kaldi

Language:PythonApache-2.0000

realtime-vad-sample

Sample code of real-time voice activity detection using webrtcvad.

Language:Python000

RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription. Designed for real-time applications like voice assistants.

Language:Python000

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonMIT000

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:PythonNOASSERTION000

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonMIT000

travel-chatbot

This project implements a travel chatbot powered by the RAG (Retrieve and Generate) chain, providing real-time information retrieval using various tools and the ability to fetch weather reports.

MIT000

vits

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Language:PythonMIT000

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonApache-2.0000

vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support streaming out!

Language:PythonMIT000

VOSk-ASR-GUI-Demo-

Language:Python000

Whisper-Finetune

微调Whisper模型和加速推理

Language:PythonApache-2.0000

whisper-finetuning

[WIP] Scripts for fine-tuning Whisper

Language:PythonMIT000

v-yunbin

wzy's repositories

AEGAN-AD

AIPC

bert4torch

Bert-VITS2-Integration-train-txt-infer

emotion-finetuning-vits

emotional-vits

FastASR

FreeVC

FunASR

Genshin_Datasets

GenshinVoice

GPT-SoVITS

icefall

MoeGoe

PaddleSpeech

Paddlespeech-Streaming-ASR-GUI

porcupine

pykaldi

realtime-vad-sample

RealtimeSTT

Retrieval-based-Voice-Conversion-WebUI

seamless_communication

silero-vad

travel-chatbot

vits

VITS-fast-fine-tuning

vits_chinese

VOSk-ASR-GUI-Demo-

Whisper-Finetune

whisper-finetuning