hitachinsk

Kaidong Zhang's starred repositories

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptMIT20669 143 444

albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Language:PythonMIT14708 126 1235

LivePortrait

Bring portraits to life!

Language:PythonNOASSERTION14360 126 415

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.011129 64 259

ShiArthur03

Language:MATLABGPL-3.010341 32 1357

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookApache-2.010027 96 423

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.09734 115 2291

video-subtitle-extractor

视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language:PythonApache-2.06839 45 299

video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Language:PythonApache-2.05670 47 120

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

MIT4835 43 100

sd-webui-reactor

Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)

Language:PythonAGPL-3.02619 30 352

2d-gaussian-splatting

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Language:PythonNOASSERTION2430 30 188

pytorch-styleguide

An unofficial styleguide and best practices summary for PyTorch

Language:PythonGPL-3.01967 46 8

shape-of-motion

Language:PythonMIT937 16 66

PowerPaint

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型，可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成，只需要一个模型

Language:PythonMIT825 17 133

SVD_Xtend

Stable Video Diffusion Training Code and Extensions.

Language:Python674 12 65

Apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

Language:C#MIT634 7 26

TransNetV2

TransNet V2: Shot Boundary Detection Neural Network

Language:PythonMIT583 9 47

DrivingDiffusion

Layout-Guided multi-view driving scene video generation with latent diffusion model

Language:PythonMIT514 21 12

SEA-RAFT

[ECCV2024 - Oral, Best Paper Award Candidate] SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow

Language:PythonBSD-3-Clause436 10 26

torch-splatting

A pure pytorch implementation of 3D gaussian Splatting

Language:Python361 13 17

reading-notes

张俊的读书笔记

Language:Jupyter NotebookMIT344 12 2

MobileFaceSwap

MobileFaceSwap: A Lightweight Framework for Video Face Swapping (AAAI 2022)

Language:PythonApache-2.0325 17 22

COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

Language:Python275 5 21

awesome-faceSwap

papers about faceSwap

259 17 3

PatentDatabases

A summary of patent information database URLs from all over the world.

GPL-3.0228 40

awesome-diffusion-v2v

Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translation. And a video editing benchmark code.

Language:PythonMIT207 5 3

DecoMotion

[ECCV 2024] Decomposition Betters Tracking Everything Everywhere

MIT113 18 1

NDR-Restore

Official Implementation of "Neural Degradation Representation Learning for All-In-One Image Restoration"

Language:PythonMIT28 1 7

awesome-face-swapping

A curated list of face swapping research papers

11 10