AI Lab's repositories

ComfyUI-RMBG

A ComfyUI custom node designed for advanced image background removal and object, face, clothes, and fashion segmentation, utilizing multiple models including RMBG-2.0, INSPYRENET, BEN, BEN2, BiRefNet, SDMatte, SAM, SAM2 and GroundingDINO.

Language:PythonLicense:GPL-3.0Stargazers:1463Issues:6Issues:110

ComfyUI-OmniGen

ComfyUI-OmniGen - A ComfyUI custom node implementation of OmniGen, a powerful text-to-image generation and editing model.

Language:PythonLicense:MITStargazers:294Issues:3Issues:42

ComfyUI-QwenVL

ComfyUI-QwenVL custom node integrates the Qwen-VL series, including the latest Qwen3-VL models, including Qwen2.5-VL and the latest Qwen3-VL, to enable advanced multimodal AI for text generation, image understanding, and video analysis.

Language:PythonLicense:GPL-3.0Stargazers:269Issues:0Issues:0

ComfyUI-JoyCaption

Joy Caption is a ComfyUI node using the LLaVA model to generate stylized image captions, supporting batch processing and GGUF models.

Language:PythonLicense:GPL-3.0Stargazers:154Issues:2Issues:27

ComfyUI-MiniCPM

A custom ComfyUI node for MiniCPM vision-language models, supporting v4, v4.5, and v4 GGUF formats, enabling high-quality image captioning and visual analysis.

Language:PythonLicense:GPL-3.0Stargazers:126Issues:1Issues:9

ComfyUI-SparkTTS

ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an advanced text-to-speech system that harnesses the power of large language models (LLMs) to generate highly accurate and natural-sounding speech.

Language:PythonLicense:GPL-3.0Stargazers:118Issues:1Issues:5

ComfyUI-LBM

A ComfyUI custom node for Latent Bridge Matching (LBM), for fast image relighting processing.

Language:PythonLicense:GPL-3.0Stargazers:80Issues:0Issues:0

ComfyUI-MiniMax-Remover

ComfyUI-MiniMax-Remover is a custom node for ComfyUI that enables fast and efficient object removal using minimax optimization. It works in two stages: first, it trains a remover with a simplified DiT model; then it distills a robust version using CFG guidance and fewer inference steps.

Language:PythonLicense:GPL-3.0Stargazers:71Issues:0Issues:2

ComfyUI-ReduxFineTune

ComfyUI-ReduxFineTune is a custom node for ComfyUI that enables advanced style fine-tuning using the Flux Redux approach. It offers multiple unified fusion modes for precise and consistent control over style transfer, allowing users to fine-tune image styles with high flexibility and detail.

Language:PythonLicense:GPL-3.0Stargazers:65Issues:3Issues:5

ComfyUI-WildPromptor

WildPromptor simplifies prompt creation, organization, and customization in ComfyUI, turning chaotic workflows into an efficient, intuitive process.

Language:PythonLicense:Apache-2.0Stargazers:57Issues:1Issues:1

ComfyUI-EdgeTTS

ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS capabilities. It enables seamless conversion of text into natural-sounding speech, supporting multiple languages and voices. Ideal for enhancing user interactions, this node is easy to integrate and customize, making it perfect for various applications.

Language:PythonLicense:GPL-3.0Stargazers:56Issues:1Issues:9

ComfyUI-FlashVSR

Powerful ComfyUI custom node built on the FlashVSR model, facilitating real-time diffusion-based video super-resolution for streaming applications.

Language:PythonLicense:GPL-3.0Stargazers:52Issues:0Issues:0

ComfyUI-MegaTTS

A ComfyUI custom node based on ByteDance MegaTTS3, enabling high-quality text-to-speech synthesis with voice cloning capabilities for both Chinese and English.

Language:PythonLicense:GPL-3.0Stargazers:49Issues:1Issues:10

ComfyUI-Pollinations

pollinations API AI Generations

Language:PythonLicense:GPL-3.0Stargazers:45Issues:1Issues:5

ComfyUI-FireRedTTS

A ComfyUI integration for FireRedTTS‑2, a real-time multi-speaker TTS system enabling high-quality, emotionally expressive dialogue and monologue synthesis. Leveraging a streaming architecture and context-aware prosody modeling, it supports natural speaker turns and stable long-form generation, ideal for interactive chat and podcast applications.

Language:PythonLicense:GPL-3.0Stargazers:40Issues:1Issues:3

Gemini-APPs

showcase of creative and useful applications built with the Google Gemini API.

Language:HTMLLicense:GPL-3.0Stargazers:17Issues:0Issues:0

ComfyUI-KokoroTTS

ComfyUI-KokoroTTS: A text-to-speech model that utilizes the Kokoro TTS framework to convert text into natural-sounding speech. It supports multiple voices and languages

Language:PythonLicense:AGPL-3.0Stargazers:13Issues:1Issues:0

ComfyUI-VoxCPMTTS

A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech) functionality. This implementation provides high-quality speech generation and voice cloning capabilities using the VoxCPM model.

Language:PythonLicense:GPL-3.0Stargazers:11Issues:0Issues:0

ComfyUI-Blip

A lightweight and high-speed ComfyUI custom node for generating image captions using BLIP models. Optimized for both GPU and CPU environments to deliver fast and efficient caption generation.

Language:PythonLicense:GPL-3.0Stargazers:9Issues:0Issues:2

ComfyUI-DeepSeek-OCR

A powerful OCR node for ComfyUI that integrates the DeepSeek-OCR model from Hugging Face.

License:GPL-3.0Stargazers:9Issues:0Issues:0

Safetensors-Converter

A robust and comprehensive Python tool to convert various AI model formats to `.safetensors` format with advanced error handling and validation.

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0
Language:JavaScriptLicense:GPL-3.0Stargazers:2Issues:0Issues:0

ComfyUI-Manager

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.

Language:PythonLicense:GPL-3.0Stargazers:2Issues:0Issues:0
License:GPL-3.0Stargazers:2Issues:0Issues:0
Stargazers:1Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:GPL-3.0Stargazers:0Issues:0Issues:0