Kaidong Zhang's starred repositories
Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
LivePortrait
Bring portraits to life!
segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
sd-webui-reactor
Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)
2d-gaussian-splatting
[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields
pytorch-styleguide
An unofficial styleguide and best practices summary for PyTorch
PowerPaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型,可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成,只需要一个模型
TransNetV2
TransNet V2: Shot Boundary Detection Neural Network
DrivingDiffusion
Layout-Guided multi-view driving scene video generation with latent diffusion model
torch-splatting
A pure pytorch implementation of 3D gaussian Splatting
reading-notes
张俊的读书笔记
MobileFaceSwap
MobileFaceSwap: A Lightweight Framework for Video Face Swapping (AAAI 2022)
awesome-faceSwap
papers about faceSwap
PatentDatabases
A summary of patent information database URLs from all over the world.
awesome-diffusion-v2v
Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translation. And a video editing benchmark code.
DecoMotion
[ECCV 2024] Decomposition Betters Tracking Everything Everywhere
NDR-Restore
Official Implementation of "Neural Degradation Representation Learning for All-In-One Image Restoration"
awesome-face-swapping
A curated list of face swapping research papers