liangym's repositories
paddlespeech_tts_cpp
PaddleSpeech TTS cpp
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
Agently
🚀 A fast way to build LLM Agent based Application 🤵 A light weight framework helps developers to create amazing LLM based applications. 🎭 You can use it to create an LLM based agent instance with role set and memory easily. ⚙️ You can use Agently agent instance just like an async function and put it anywhere in your code.
aidatatang_200zh
Aidatatang_200zh is an open source Chinese Mandarin speech corpus released by DataTang Technology Co., Ltd (www.datatang.com).
AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Bert-VITS2
vits2 backbone with multilingual-bert
CommonCode
Save some common code
deep-clustering
deep clustering method for single-channel speech separation
deepcluster
Deep Clustering for Unsupervised Learning of Visual Features
DeepClustering
Deep Clustering
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
docker-kaldi-gstreamer-server
Dockerfile for kaldi-gstreamer-server.
encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
FastGPT
FastGPT is a knowledge-based question answering system built on the LLM. It offers out-of-the-box data processing and model invocation capabilities. Moreover, it allows for workflow orchestration through Flow visualization, thereby enabling complex question and answer scenarios!
HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
NMFLibrary
MATLAB library for non-negative matrix factorization (NMF): Version 1.8.0
pytorch-lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Wave-U-Net
Implementation of the Wave-U-Net for audio source separation