Kaidong Zhang (hitachinsk)

hitachinsk

User data from Github https://github.com/hitachinsk

Company:Alibaba Group << USTC

Location:Kaiyuan county, Liaoning

Home Page:https://hitachinsk.github.io/

GitHub:@hitachinsk

Kaidong Zhang's starred repositories

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptLicense:MITStargazers:20669Issues:143Issues:444

albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Language:PythonLicense:MITStargazers:14708Issues:126Issues:1235

LivePortrait

Bring portraits to life!

Language:PythonLicense:NOASSERTIONStargazers:14360Issues:126Issues:415

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:11129Issues:64Issues:259

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10027Issues:96Issues:423

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:9734Issues:115Issues:2291

video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language:PythonLicense:Apache-2.0Stargazers:6839Issues:45Issues:299

video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Language:PythonLicense:Apache-2.0Stargazers:5670Issues:47Issues:120

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

sd-webui-reactor

Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)

Language:PythonLicense:AGPL-3.0Stargazers:2619Issues:30Issues:352

2d-gaussian-splatting

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Language:PythonLicense:NOASSERTIONStargazers:2430Issues:30Issues:188

pytorch-styleguide

An unofficial styleguide and best practices summary for PyTorch

Language:PythonLicense:GPL-3.0Stargazers:1967Issues:46Issues:8

PowerPaint

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型,可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成,只需要一个模型

Language:PythonLicense:MITStargazers:825Issues:17Issues:133

SVD_Xtend

Stable Video Diffusion Training Code and Extensions.

Apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

Language:C#License:MITStargazers:634Issues:7Issues:26

TransNetV2

TransNet V2: Shot Boundary Detection Neural Network

Language:PythonLicense:MITStargazers:583Issues:9Issues:47

DrivingDiffusion

Layout-Guided multi-view driving scene video generation with latent diffusion model

Language:PythonLicense:MITStargazers:514Issues:21Issues:12

SEA-RAFT

[ECCV2024 - Oral, Best Paper Award Candidate] SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow

Language:PythonLicense:BSD-3-ClauseStargazers:436Issues:10Issues:26

torch-splatting

A pure pytorch implementation of 3D gaussian Splatting

reading-notes

张俊的读书笔记

Language:Jupyter NotebookLicense:MITStargazers:344Issues:12Issues:2

MobileFaceSwap

MobileFaceSwap: A Lightweight Framework for Video Face Swapping (AAAI 2022)

Language:PythonLicense:Apache-2.0Stargazers:325Issues:17Issues:22

COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

PatentDatabases

A summary of patent information database URLs from all over the world.

License:GPL-3.0Stargazers:228Issues:4Issues:0

awesome-diffusion-v2v

Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translation. And a video editing benchmark code.

Language:PythonLicense:MITStargazers:207Issues:5Issues:3

DecoMotion

[ECCV 2024] Decomposition Betters Tracking Everything Everywhere

NDR-Restore

Official Implementation of "Neural Degradation Representation Learning for All-In-One Image Restoration"

Language:PythonLicense:MITStargazers:28Issues:1Issues:7

awesome-face-swapping

A curated list of face swapping research papers