刘国友's repositories

resemble-enhance

AI powered speech denoising and enhancement

License:MITStargazers:1Issues:0Issues:0
License:MITStargazers:1Issues:0Issues:0

adetailer

Auto detecting, masking and inpainting with detection model.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

Af-DCD

The official project website of "Augmentation-free Dense Contrastive Distillation for Efficient Semantic Segmentation" (Af-DCD for short, accepted to NeurIPS 2023).

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

BiMatting

This project is the official implementation of our accepted NeurIPS 2023 paper BiMatting: Efficient Video Matting via Binarization.

Language:PythonStargazers:0Issues:1Issues:0

BVI-VFI-database

[IEEE TIP'2023] "BVI-VFI: A Video Quality Database for Video Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull

License:NOASSERTIONStargazers:0Issues:1Issues:0

CA-SUM-360

A PyTorch implementation of our method from "An Integrated System for Spatio-Temporal Summarization of 360-degrees Videos", Proc. MMM 2024

Language:PythonStargazers:0Issues:0Issues:0

COMM

Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

License:MITStargazers:0Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

EResFD

Lightweight Face Detector from CLOVA

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

FastLLVE

FastLLVE: Real-Time Low-Light Video Enhancement with Intensity-Aware Lookup Table (ACM MM 2023)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

frame-interpolation-pytorch

PyTorch implementation of FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

gpt_academic

为ChatGPT/GLM提供实用化交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm2等本地模型。兼容文心一言, moss, llama2, rwkv, claude2, 通义千问, 书生, 讯飞星火等。

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

HybridSORT

[AAAI2024]Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Lightweight-Face-Detector-Pruning

Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geometric Median Criterion", Proc. IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW 2024), Waikoloa, Hawaii, USA, Jan. 2024.

Stargazers:0Issues:0Issues:0

Matting-Anything

Matting Anything Model (MAM), an efficient and versatile framework for estimating the alpha matte of any instance in an image with flexible and interactive visual or linguistic user prompt guidance.

License:MITStargazers:0Issues:0Issues:0

Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image"

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

MFT

MFT: Long-Term Tracking of Every Pixel -- code for the WACV 2024 paper

License:NOASSERTIONStargazers:0Issues:0Issues:0

MISO-VFI

Official implementation of "A Multi-In-Single-Out Network for Video Frame Interpolation without Optical Flow"

Language:PythonStargazers:0Issues:1Issues:0

MobileSAM-pytorch

Reproduction of MobileSAM using pytorch

Language:PythonStargazers:0Issues:1Issues:0

PixArt-alpha

Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

sd-webui-fastblend

Make videos smooth!

License:Apache-2.0Stargazers:0Issues:0Issues:0

SlimSAM

SlimSAM: 0.1% Data Makes Segment Anything Slim

Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

syenet

SYENet: A Simple Yet Effective Network for Multiple Low-Level Vision Tasks with Real-Time Performance on Mobile Device, in ICCV 2023

License:Apache-2.0Stargazers:0Issues:0Issues:0

terminaltexteffects

Visual effects applied to text in the terminal.

License:MITStargazers:0Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

XMem2

A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking

License:GPL-3.0Stargazers:0Issues:0Issues:0