AmorJNYH's repositories
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
canoSpeech
text to speech
CoMoSpeech
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
RealtimeTTS
Converts text to speech in realtime by identifying sentence fragments for immediate auditory feedback. Ideal for applications requiring instant audio responses.
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
AQUA-Tk
语音质量自动打分,AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)
pythainlp
Thai Natural Language Processing in Python.
Diff-HierVC
Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"
Transformer-TTS-more
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
LLVC
实时变声
thumbor
thumbor is an open-source photo thumbnail service by globo.com
AnimateDiff
Official implementation of AnimateDiff.
aidea
AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等,支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。
openvidu
OpenVidu Platform main repository
wavmark
AI-based Audio Watermarking Tool
supersonic
SuperSonic is an out-of-the-box yet highly extensible framework for building ChatBI
audino
Open source audio annotation tool for humans
kantts
kantts部署,TTS appalication based on modelscope KAN-TTS
openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
so-vits-svc-fork
实时变声 so-vits-svc fork with REALTIME support (voice changer) and greatly improved interface.
Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
UniAudio
The Open Source Code of UniAudio
Text-to-speech
TTS数据处理
HyperLips
Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".
EasyNLP
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit