Xinyu Huang (xinyu1205)

xinyu1205

Geek Repo

Company:Fudan University

Location:Shanghai, China

Home Page:https://xinyu1205.github.io

Github PK Tool:Github PK Tool

Xinyu Huang's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46146Issues:304Issues:658

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:13231Issues:149Issues:526

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8779Issues:133Issues:438

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8575Issues:96Issues:380

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:7985Issues:56Issues:1488

cocoapi

COCO API - Dataset @ http://cocodataset.org/

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:6039Issues:112Issues:555

tpu

Reference models and tools for Cloud TPUs.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5207Issues:355Issues:473

Object-Detection-Metrics

Most popular metrics used to evaluate object detection algorithms.

Language:PythonLicense:MITStargazers:4914Issues:70Issues:149

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonLicense:MITStargazers:4769Issues:60Issues:79

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonLicense:Apache-2.0Stargazers:3191Issues:39Issues:248

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonLicense:MITStargazers:2402Issues:29Issues:230

DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:2120Issues:31Issues:257

detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:1942Issues:25Issues:161
Language:PythonLicense:Apache-2.0Stargazers:1722Issues:123Issues:20

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonLicense:Apache-2.0Stargazers:1276Issues:34Issues:68

ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Language:PythonLicense:NOASSERTIONStargazers:870Issues:14Issues:35

DAB-DETR

[ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:499Issues:17Issues:71

DCNv4

[CVPR 2024] Deformable Convolution v4

Language:PythonLicense:MITStargazers:446Issues:7Issues:69

MaskCLIP

Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)

Language:PythonLicense:Apache-2.0Stargazers:380Issues:7Issues:18

DreamLLM

[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation

Language:PythonLicense:Apache-2.0Stargazers:361Issues:17Issues:22
Language:PythonLicense:Apache-2.0Stargazers:316Issues:20Issues:25

LOST

Pytorch implementation of LOST unsupervised object discovery method

Language:PythonLicense:NOASSERTIONStargazers:234Issues:8Issues:16

nxtp

Object Recognition as Next Token Prediction (CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:146Issues:2Issues:5

SCLIP

Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference

DAC-DETR

[NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".

Language:PythonLicense:MITStargazers:51Issues:1Issues:5

GroupDETR

[ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

FineR

[ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:32Issues:0Issues:0

TagAlign

Official implementation of TagAlign

Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Language:PythonLicense:Apache-2.0Stargazers:17Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:7Issues:1Issues:1