刘国友's repositories

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonLicense:MITStargazers:1Issues:1Issues:0
Language:PythonLicense:MITStargazers:1Issues:1Issues:0

adetailer

Auto detecting, masking and inpainting with detection model.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

Af-DCD

The official project website of "Augmentation-free Dense Contrastive Distillation for Efficient Semantic Segmentation" (Af-DCD for short, accepted to NeurIPS 2023).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

CA-SUM-360

A PyTorch implementation of our method from "An Integrated System for Spatio-Temporal Summarization of 360-degrees Videos", Proc. MMM 2024

Language:PythonStargazers:0Issues:1Issues:0

COMM

Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

License:MITStargazers:0Issues:1Issues:0

ELD

Physics-based Noise Modeling for Extreme Low-light Photography (CVPR 2020 Oral & TPAMI 2021)

License:MITStargazers:0Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

EResFD

Lightweight Face Detector from CLOVA

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

frame-interpolation-pytorch

PyTorch implementation of FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

FuseSR

[SIGGRAPH Asia 2023]FuseSR: Super Resolution for Real-time Rendering through Efficient Multi-resolution Fusion

License:MITStargazers:0Issues:0Issues:0

gpt_academic

为ChatGPT/GLM提供实用化交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm2等本地模型。兼容文心一言, moss, llama2, rwkv, claude2, 通义千问, 书生, 讯飞星火等。

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

HybridSORT

[AAAI2024]Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ISP-Guide

Image Signal Processing (ISP) Guide. Learn all about the process of converting an image/video into digital form by performing tasks like noise reduction, filtering, auto exposure, autofocus, HDR correction, and image sharpening with a Specialized type of media processor.

Language:PythonStargazers:0Issues:0Issues:0

Matting-Anything

Matting Anything Model (MAM), an efficient and versatile framework for estimating the alpha matte of any instance in an image with flexible and interactive visual or linguistic user prompt guidance.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image"

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

MFT

MFT: Long-Term Tracking of Every Pixel -- code for the WACV 2024 paper

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

MISO-VFI

Official implementation of "A Multi-In-Single-Out Network for Video Frame Interpolation without Optical Flow"

Language:PythonStargazers:0Issues:1Issues:0

PixArt-alpha

Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

sd-webui-fastblend

Make videos smooth!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

SkinToneClassifier

An easy-to-use library for skin tone classification

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

SlimSAM

SlimSAM: 0.1% Data Makes Segment Anything Slim

Language:PythonStargazers:0Issues:1Issues:0

syenet

SYENet: A Simple Yet Effective Network for Multiple Low-Level Vision Tasks with Real-Time Performance on Mobile Device, in ICCV 2023

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

terminaltexteffects

Visual effects applied to text in the terminal.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

XMem2

A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0