wangwenfeng0

王文锋's starred repositories

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonMIT36534 246 5433

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++MIT35477 312 1362

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT35201 211 1298

facefusion

Industry leading face manipulation platform

Language:PythonNOASSERTION19367 1860

rembg

Rembg is a tool to remove images background

Language:PythonMIT16824 148 505

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:Python10716 172 663

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.08579 93 1930

stable-diffusion-webui-forge

Language:PythonAGPL-3.08329 85 1403

CenterNet

Object detection, 3D detection, and pose estimation using center point detection:

Language:PythonMIT7279 113 1002

Auto-Photoshop-StableDiffusion-Plugin

A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using either Automatic or ComfyUI as a backend.

Language:TypeScriptMIT6772 75 370

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.06612 74 245

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Language:PythonNOASSERTION5557 73 205

Realtime_Multi-Person_Pose_Estimation

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

Language:Jupyter NotebookNOASSERTION5097 258 236

transformer-debugger

Language:PythonMIT4030 25 14

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonApache-2.03685 37 90

ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Language:TypeScriptNOASSERTION2740 61 81

human

Human: AI-powered 3D Face Detection & Rotation Tracking, Face Description & Recognition, Body Pose Tracking, 3D Hand & Finger Tracking, Iris Analysis, Age & Gender & Emotion Prediction, Gaze Tracking, Gesture Recognition

Language:HTMLMIT2357 44 282