splinter21's repositories
dub_genius
基于GPT-SoVITS的视频剪辑快捷配音工具
animate-anything
Fine-Grained Open Domain Image Animation with Motion Guidance
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Auto-Convert-Music
Discover the AI Singing Software, a revolutionary application that changes how we enjoy music. Using advanced audio processing, it extracts vocals from songs, enabling AI to perform them in a unique and captivating style.
bili-scraper
Scrape data of videos and users from Bilibili
chinese-g2p
中文文本到发音转换(传统的字典+最大匹配分词方法)
clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
ComfyUI-Workflows-ZHO
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
DWT-FFC
Official PyTorch implementation of dehazing method based on FFC and ConvNeXt, 1st place solution of NTIRE 2023 HR NonHomogeneous Dehazing Challenge (CVPR Workshop 2023).
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
imgutils
A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
megatts2
Unoffical implementation of Megatts2
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
MoeSR
An application specialized in image super-resolution for ACGN illustrations and Visual Novel CG. 专注于插画/Galgame CG等ACGN领域的图像超分辨率的应用
MVSEP-MDX23-Colab_v2
Colab adaptation of MVSep Model for MDX23 music separation contest
my-docs-website
AlbertZhang的文档网站
python-spider
爬虫从入门到入狱
Resume
🌝软件工程师-latex简历模板,制作一份简洁优雅的程序员简历。star或fork后拿走~
sheetsage
Transcribe music into lead sheets!
stable-speech
Reproduction of Stability AI's Text-to-Speech model.
TextrolSpeech
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models (2024 ICASSP)
tinyvc
a lightweight voice conversion
xtts-webui
Webui for using XTTS and for finetuning it