zzz0326's starred repositories

CenterTrack

Simultaneous object detection and tracking using center points.

Language:PythonLicense:MITStargazers:2341Issues:0Issues:0

StrongSORT

[TMM 2023] StrongSORT: Make DeepSORT Great Again

Language:PythonLicense:GPL-3.0Stargazers:707Issues:0Issues:0

mmtracking

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Language:PythonLicense:Apache-2.0Stargazers:3435Issues:0Issues:0

SeqTrackv2

SeqTrackv2: Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking

Language:PythonLicense:MITStargazers:36Issues:0Issues:0

SFSORT

SFSORT: Scene Features-based Simple Online Real-Time Tracker

Language:Jupyter NotebookLicense:MITStargazers:22Issues:0Issues:0

ISP-Teacher

[AAAI24] ISP-Teacher: Image Signal Process with Disentanglement Regularization for Unsupervised Domain Adaptive Dark Object Detection

Language:PythonStargazers:9Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66457Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23810Issues:0Issues:0

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:11003Issues:0Issues:0

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:163491Issues:0Issues:0

consistency_models

Official repo for consistency models.

Language:PythonLicense:MITStargazers:6021Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:6078Issues:0Issues:0

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language:Jupyter NotebookLicense:MITStargazers:3178Issues:0Issues:0

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14034Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45291Issues:0Issues:0

Palette-Image-to-Image-Diffusion-Models

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

Language:PythonLicense:MITStargazers:1434Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:60902Issues:0Issues:0

360-mlc

[NeurIPS'22] 360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning

Language:PythonStargazers:9Issues:0Issues:0

V-CNN

Viewport-based CNN for visual quality assessment on 360° video

Language:PythonLicense:MITStargazers:33Issues:0Issues:0

stdf-pytorch

Implementation of "Spatio-Temporal Deformable Convolution for Compressed Video Quality Enhancement" (AAAI'20).

Language:PythonLicense:Apache-2.0Stargazers:155Issues:0Issues:0

gmflow

[CVPR'22 Oral] GMFlow: Learning Optical Flow via Global Matching

Language:PythonLicense:Apache-2.0Stargazers:591Issues:0Issues:0

MiVOS

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!

Language:PythonLicense:MITStargazers:458Issues:0Issues:0

STM

Video Object Segmentation using Space-Time Memory Networks

Language:PythonStargazers:405Issues:0Issues:0

CCNet

CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).

Language:PythonLicense:MITStargazers:1407Issues:0Issues:0

mmselfsup

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:3130Issues:0Issues:0

swav

PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882

Language:PythonLicense:NOASSERTIONStargazers:1960Issues:0Issues:0

saliency-360salient-2017

Scanpath Prediction on 360 degree Images using deep learning

Language:Jupyter NotebookLicense:MITStargazers:53Issues:0Issues:0

visualization

a collection of visualization function

Language:PythonLicense:MITStargazers:379Issues:0Issues:0
Language:PythonLicense:MITStargazers:508Issues:0Issues:0

self-supervised-relational-reasoning

Official PyTorch implementation of the paper "Self-Supervised Relational Reasoning for Representation Learning", NeurIPS 2020 Spotlight.

Language:PythonLicense:MITStargazers:141Issues:0Issues:0