AaronPanXiaoFeng's starred repositories

ComfyUI-AnimateDiff-Evolved

Improved AnimateDiff for ComfyUI and Advanced Sampling Support

Language:PythonLicense:Apache-2.0Stargazers:2637Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:3910Issues:0Issues:0

ComfyUI-Advanced-ControlNet

ControlNet scheduling and masking nodes with sliding context support

Language:PythonLicense:GPL-3.0Stargazers:558Issues:0Issues:0
Language:PythonStargazers:137Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:134Issues:0Issues:0

was-node-suite-comfyui

An extensive node suite for ComfyUI with over 210 new nodes

Language:Jupyter NotebookLicense:MITStargazers:1145Issues:0Issues:0

ComfyUI-LLaVA-Captioner

A ComfyUI extension for chatting with your images with LLaVA. Runs locally, no external services, no filter.

Language:PythonLicense:GPL-3.0Stargazers:104Issues:0Issues:0

comfyui_segment_anything

Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.

Language:PythonLicense:Apache-2.0Stargazers:673Issues:0Issues:0

FreeU

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

License:MITStargazers:1701Issues:0Issues:0
Language:JavaScriptLicense:Apache-2.0Stargazers:469Issues:0Issues:0

ComfyUI_Comfyroll_CustomNodes

Custom nodes for SDXL and SD1.5 including Multi-ControlNet, LoRA, Aspect Ratio, Process Switches, and many more nodes.

Language:PythonStargazers:626Issues:0Issues:0

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1620Issues:0Issues:0

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9511Issues:0Issues:0

PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Language:PythonLicense:Apache-2.0Stargazers:885Issues:0Issues:0

facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Language:PythonLicense:MITStargazers:4459Issues:0Issues:0

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonLicense:MITStargazers:28670Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4487Issues:0Issues:0

python-pinyin

汉字转拼音(pypinyin)

Language:PythonLicense:MITStargazers:4837Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:64Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7845Issues:0Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4776Issues:0Issues:0
Language:Jupyter NotebookStargazers:208Issues:0Issues:0
Language:Jupyter NotebookStargazers:182Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:2192Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:7275Issues:0Issues:0

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonLicense:MITStargazers:591Issues:0Issues:0

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31734Issues:0Issues:0

stable-diffusion-reference-only

img2img version of stable diffusion. Anime Character Remix. Line Art Automatic Coloring. Style Transfer.

Language:PythonLicense:Apache-2.0Stargazers:129Issues:0Issues:0

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookLicense:MITStargazers:2739Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:34035Issues:0Issues:0