AaronPanXiaoFeng

followers

following

stars

AaronPanXiaoFeng's starred repositories

ComfyUI-AnimateDiff-Evolved

Improved AnimateDiff for ComfyUI and Advanced Sampling Support

Language:PythonApache-2.0263700

ComfyUI_IPAdapter_plus

Language:PythonGPL-3.0391000

ComfyUI-Advanced-ControlNet

ControlNet scheduling and masking nodes with sliding context support

Language:PythonGPL-3.055800

comfyui-art-venture

Language:Python13700

ComfyUI-MotionCtrl

Language:PythonApache-2.013400

was-node-suite-comfyui

An extensive node suite for ComfyUI with over 210 new nodes

Language:Jupyter NotebookMIT114500

ComfyUI-LLaVA-Captioner

A ComfyUI extension for chatting with your images with LLaVA. Runs locally, no external services, no filter.

Language:PythonGPL-3.010400

comfyui_segment_anything

Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.

Language:PythonApache-2.067300

FreeU

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

MIT170100

cg-use-everywhere

Language:JavaScriptApache-2.046900

ComfyUI_Comfyroll_CustomNodes

Custom nodes for SDXL and SD1.5 including Multi-ControlNet, LoRA, Aspect Ratio, Process Switches, and many more nodes.

Language:Python62600

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonApache-2.0162000

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonApache-2.0951100

PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画

Language:PythonApache-2.088500

facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Language:PythonMIT445900

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonMIT2867000

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT448700

python-pinyin

汉字转拼音(pypinyin)

Language:PythonMIT483700

StyleTTS2

Language:PythonGPL-3.06400

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonAGPL-3.0784500

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT477600

stable-video-diffusion-colab

Language:Jupyter Notebook20800

latent-consistency-model-colab

Language:Jupyter Notebook18200

deforum-stable-diffusion

Language:PythonNOASSERTION219200

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.0727500

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonMIT59100

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.03173400

stable-diffusion-reference-only

img2img version of stable diffusion. Anime Character Remix. Line Art Automatic Coloring. Style Transfer.

Language:PythonApache-2.012900

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookMIT273900

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03403500