xinyu1205

Xinyu Huang's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.046146 304 658

detr

End-to-End Object Detection with Transformers

Language:PythonApache-2.013231 149 526

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookMIT8779 133 438

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookApache-2.08575 96 380

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonApache-2.07985 56 1488

cocoapi

COCO API - Dataset @ http://cocodataset.org/

Language:Jupyter NotebookNOASSERTION6039 112 555

tpu

Reference models and tools for Cloud TPUs.

Language:Jupyter NotebookApache-2.05207 355 473

Object-Detection-Metrics

Most popular metrics used to evaluate object detection algorithms.

Language:PythonMIT4914 70 149

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonMIT4769 60 79

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonApache-2.03191 39 248

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT2402 29 230

DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Language:PythonApache-2.02120 31 257

detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Language:PythonApache-2.01942 25 161

LVM

Language:PythonApache-2.01722 123 20

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonApache-2.01276 34 68

ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Language:PythonNOASSERTION870 14 35

DAB-DETR

[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"

Language:Jupyter NotebookApache-2.0499 17 71

DCNv4

[CVPR 2024] Deformable Convolution v4

Language:PythonMIT446 7 69

MaskCLIP

Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)

Language:PythonApache-2.0380 7 18

DreamLLM

[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation

Language:PythonApache-2.0361 17 22

LLaVA-Grounding

Language:PythonApache-2.0316 20 25

LOST

Pytorch implementation of LOST unsupervised object discovery method

Language:PythonNOASSERTION234 8 16

nxtp

Object Recognition as Next Token Prediction (CVPR 2024)

Language:PythonNOASSERTION146 2 5

SCLIP

Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference

Language:Python103 4 15

DAC-DETR

[NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".

Language:PythonMIT51 1 5

GroupDETR

[ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

39 8 1

FineR

[ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models

Language:PythonApache-2.03200

TagAlign

Official implementation of TagAlign

Language:Python31 4 2

Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Language:PythonApache-2.017 10

H-Detic-LVIS

Language:PythonApache-2.07 1 1