zxiaomzxm's starred repositories
open-webui
User-friendly WebUI for AI (Formerly Ollama WebUI)
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
InvokeAI
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
FastGPT
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
ComfyUI-3D-Pack
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
comflowyspace
Comflowyspace is an intuitive, user-friendly, open-source AI tool for generating images and videos, democratizing access to AI technology.
comfyui-deploy
An open source `vercel` like deployment platform for Comfy UI
MagicDrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
comfy-server
comfyui server to use comfyui API as easy as send a message
LaneSegNet
[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving
ComfyUI-BiRefNet-ZHO
Better version for BiRefNet in ComfyUI | Both img & video
MagicDrive3D
Official implementation of the paper “MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes”