刘国友's repositories
adetailer
Auto detecting, masking and inpainting with detection model.
BiMatting
This project is the official implementation of our accepted NeurIPS 2023 paper BiMatting: Efficient Video Matting via Binarization.
BVI-VFI-database
[IEEE TIP'2023] "BVI-VFI: A Video Quality Database for Video Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull
COMM
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Diff-Protect
🛡️ Code repo for paper: Toward effective protection against diffusion-based mimicry through score distillation
DQ-Det
Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
EResFD
Lightweight Face Detector from CLOVA
FastLLVE
FastLLVE: Real-Time Low-Light Video Enhancement with Intensity-Aware Lookup Table (ACM MM 2023)
frame-interpolation-pytorch
PyTorch implementation of FILM: Frame Interpolation for Large Motion, In ECCV 2022.
gpt_academic
为ChatGPT/GLM提供实用化交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm2等本地模型。兼容文心一言, moss, llama2, rwkv, claude2, 通义千问, 书生, 讯飞星火等。
instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
LiteTrack
A fast and high-performance visual object tracker with real-time speed on Jetson.
MARLIN
[CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg
Metric3D
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image"
MFT
MFT: Long-Term Tracking of Every Pixel -- code for the WACV 2024 paper
MobileSAM-pytorch
Reproduction of MobileSAM using pytorch
MVT
[BMVC 2023] Mobile Vision Transformer-based Visual Object Tracking
norfair
Lightweight Python library for adding real-time multi-object tracking to any detector.
PixArt-alpha
Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Portrait-Mode-Video
Video dataset dedicated to portrait-mode video recognition.
ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
ROMTrack
[ICCV 2023] Robust Object Modeling for Visual Tracking, Official Implementation
sd-webui-fastblend
Make videos smooth!
syenet
SYENet: A Simple Yet Effective Network for Multiple Low-Level Vision Tasks with Real-Time Performance on Mobile Device, in ICCV 2023
terminaltexteffects
Visual effects applied to text in the terminal.
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
XMem2
A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking