Xelawk's starred repositories
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
generative-models
Generative Models by Stability AI
one-key-hidpi
Enable macOS HiDPI and have a native setting.
clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
comfyui_controlnet_aux
ComfyUI's ControlNet Auxiliary Preprocessors
auto-subtitle
Automatically generate and overlay subtitles for any video.
HRNet-Facial-Landmark-Detection
This is an official implementation of facial landmark detection for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
Wav2Lip-GFPGAN
High quality Lip sync
awesome-faceReenactment
papers about Face Reenactment/Talking Face Generation
ComfyUI-Stable-Video-Diffusion
ComfyUI nodes for Stable Video Diffusion
StableIdentity
🔥 StableIdentity: Inserting Anybody into Anywhere at First Sight
sd-webui-loractl
An Automatic1111 extension for dynamically controlling the weights of LoRAs during image generation
watermark-detection
Model for watermark classification implemented with PyTorch