liangym's repositories

paddlespeech_tts_cpp

PaddleSpeech TTS cpp

bark

🔊 Text-prompted Generative Audio Model

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

Agently

🚀 A fast way to build LLM Agent based Application 🤵 A light weight framework helps developers to create amazing LLM based applications. 🎭 You can use it to create an LLM based agent instance with role set and memory easily. ⚙️ You can use Agently agent instance just like an async function and put it anywhere in your code.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

aidatatang_200zh

Aidatatang_200zh is an open source Chinese Mandarin speech corpus released by DataTang Technology Co., Ltd (www.datatang.com).

Language:ShellStargazers:0Issues:0Issues:0

audio-SNR

Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)

Language:PythonStargazers:0Issues:1Issues:0

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

License:AGPL-3.0Stargazers:0Issues:0Issues:0

CommonCode

Save some common code

Language:PythonStargazers:0Issues:1Issues:0

deep-clustering

deep clustering method for single-channel speech separation

Stargazers:0Issues:0Issues:0

deepcluster

Deep Clustering for Unsupervised Learning of Visual Features

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

DeepClustering

Deep Clustering

Stargazers:0Issues:1Issues:0

DeepSpeech

A TensorFlow implementation of Baidu's DeepSpeech architecture

Language:C++License:MPL-2.0Stargazers:0Issues:1Issues:0

docker-kaldi-gstreamer-server

Dockerfile for kaldi-gstreamer-server.

Language:DockerfileLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

License:NOASSERTIONStargazers:0Issues:0Issues:0

FastGPT

FastGPT is a knowledge-based question answering system built on the LLM. It offers out-of-the-box data processing and model invocation capabilities. Moreover, it allows for workflow orchestration through Flow visualization, thereby enabling complex question and answer scenarios!

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

HarvestText

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

License:MITStargazers:0Issues:0Issues:0

KWS_RUIM

scripts used for kws project

Language:PythonStargazers:0Issues:2Issues:0

masr

中文语音识别,提供预训练模型,高识别率 Chinese Speech Recognition; Mandarin Automatic Speech Recognition;

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

License:NOASSERTIONStargazers:0Issues:0Issues:0

NMFLibrary

MATLAB library for non-negative matrix factorization (NMF): Version 1.8.0

Language:MATLABLicense:MITStargazers:0Issues:1Issues:0

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

License:Apache-2.0Stargazers:0Issues:0Issues:0

resample

重采样 8k 变 16k 或者其他

Language:PythonStargazers:0Issues:1Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

License:MITStargazers:0Issues:0Issues:0

Wave-U-Net

Implementation of the Wave-U-Net for audio source separation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0