lxz's repositories
stable-diffusion-webui
Stable Diffusion web UI
tacotronv2_wavernn_chinese
tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)
audiocraft_plus
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
bark-training-cloning
for training the model
carefree-creator
An AI-powered creator for everyone.
DiffSinger
PyTorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)
DiffSinger-1
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Forked and maintained by the OpenVPI community
disable-flutter-tls-verification
A Frida script that disables Flutter's TLS verification
dream-textures
Stable Diffusion built-in to the Blender shader editor
facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Games
Home Page Link:
lobe-chat
🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.
MDM
MDM
midi-js-soundfonts
Pre-rendered General MIDI soundfonts that can be used immediately with MIDI.js
muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
OpenVoice
Instant voice cloning by MyShell.
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA/Streaming ASR witch punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
ppg-vc
PPG-Based Voice Conversion
roop
one-click deepfake (face swap)
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
singing_transcription_ICASSP2021
The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"
so-vits-svc
SoftVC VITS Singing Voice Conversion
UniAudio
The Open Source Code of UniAudio
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
vits
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit