王文锋's starred repositories
llama_index
LlamaIndex is a data framework for your LLM applications
whisper.cpp
Port of OpenAI's Whisper model in C/C++
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
facefusion
Industry leading face manipulation platform
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Auto-Photoshop-StableDiffusion-Plugin
A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using either Automatic or ComfyUI as a backend.
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Realtime_Multi-Person_Pose_Estimation
Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)
sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
ComfyUI-layerdiffuse
Layer Diffuse custom nodes
MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
ComfyUI-InstantID
Unofficial implementation of InstantID for ComfyUI
trt_pose_hand
Real-time hand pose estimation and gesture classification using TensorRT
human-action-recognition
Multi Person Skeleton Based Action Recognition and Tracking
UVC4UnityAndroid
UVC4UnityAndroid