Beast code in Giters

xiaoyazhu's starred repositories

OVDEval

A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)

Language:PythonApache-2.03100

OmDet

Fast and accurate open-vocabulary end-to-end object detection

Language:PythonApache-2.02700

Awesome-Open-Vocabulary-Object-Detection

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

20900

awesome-open-world-object-detection

This repository lists some awesome public Open World object detection series projects.

1700

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

880700

APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Language:PythonApache-2.041100

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

63500

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.0495200

awesome-described-object-detection

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.

10800

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT2203600

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonMIT87300

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:Jupyter NotebookGPL-3.0331100

openimages2coco

Convert Open Images annotations into MS Coco format to make it a drop in replacement

Language:Jupyter NotebookMIT10500

RegionCLIP

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Language:PythonApache-2.064300

open-images-dataset

Open Images is a dataset of ~9 million images that have been annotated with image-level labels and bounding boxes spanning thousands of classes.

95300

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language:PythonGPL-3.0239200

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Language:PythonApache-2.0129400

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION363000

UniDetector

Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".

Language:PythonApache-2.047900

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.01341200

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.04391100

GLIP

Grounded Language-Image Pre-training

Language:PythonMIT194700

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonApache-2.0391900

BrnoCompSpeed

Code for BrnoCompSpeed dataset

Language:Python100

CRAFT-Reimplementation

CRAFT-Pyotorch：Character Region Awareness for Text Detection Reimplementation for Pytorch

Language:Python46100

caffe

Caffe: a fast open framework for deep learning.

Language:C++NOASSERTION3384300

mmtracking

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Language:PythonApache-2.0337200

deep_sort_pytorch

MOT using deepsort and yolov3 with pytorch

Language:PythonMIT270000

yolo_tracking

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Language:PythonAGPL-3.0607500