xiaoyazhu

xiaoyazhu

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

xiaoyazhu's starred repositories

OVDEval

A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)

Language:PythonLicense:Apache-2.0Stargazers:31Issues:0Issues:0

OmDet

Fast and accurate open-vocabulary end-to-end object detection

Language:PythonLicense:Apache-2.0Stargazers:27Issues:0Issues:0
Stargazers:1586Issues:0Issues:0

Awesome-Open-Vocabulary-Object-Detection

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

Stargazers:209Issues:0Issues:0

awesome-open-world-object-detection

This repository lists some awesome public Open World object detection series projects.

Stargazers:17Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Stargazers:8807Issues:0Issues:0

APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Language:PythonLicense:Apache-2.0Stargazers:411Issues:0Issues:0

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

Stargazers:635Issues:0Issues:0

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:4952Issues:0Issues:0

awesome-described-object-detection

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.

Stargazers:108Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:22036Issues:0Issues:0

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonLicense:MITStargazers:873Issues:0Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:3311Issues:0Issues:0

openimages2coco

Convert Open Images annotations into MS Coco format to make it a drop in replacement

Language:Jupyter NotebookLicense:MITStargazers:105Issues:0Issues:0

RegionCLIP

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Language:PythonLicense:Apache-2.0Stargazers:643Issues:0Issues:0

open-images-dataset

Open Images is a dataset of ~9 million images that have been annotated with image-level labels and bounding boxes spanning thousands of classes.

Stargazers:953Issues:0Issues:0

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Language:PythonLicense:GPL-3.0Stargazers:2392Issues:0Issues:0

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Language:PythonLicense:Apache-2.0Stargazers:1294Issues:0Issues:0

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:3630Issues:0Issues:0

UniDetector

Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".

Language:PythonLicense:Apache-2.0Stargazers:479Issues:0Issues:0

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13412Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:43911Issues:0Issues:0

GLIP

Grounded Language-Image Pre-training

Language:PythonLicense:MITStargazers:1947Issues:0Issues:0

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:3919Issues:0Issues:0

BrnoCompSpeed

Code for BrnoCompSpeed dataset

Language:PythonStargazers:1Issues:0Issues:0

CRAFT-Reimplementation

CRAFT-Pyotorch:Character Region Awareness for Text Detection Reimplementation for Pytorch

Language:PythonStargazers:461Issues:0Issues:0

caffe

Caffe: a fast open framework for deep learning.

Language:C++License:NOASSERTIONStargazers:33843Issues:0Issues:0

mmtracking

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Language:PythonLicense:Apache-2.0Stargazers:3372Issues:0Issues:0

deep_sort_pytorch

MOT using deepsort and yolov3 with pytorch

Language:PythonLicense:MITStargazers:2700Issues:0Issues:0

yolo_tracking

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Language:PythonLicense:AGPL-3.0Stargazers:6075Issues:0Issues:0