zhang-tao-whu

followers

following

stars

https://zhang-tao-whu.github.io/

zhangtao's repositories

e2ec

E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation

Language:PythonNOASSERTION214 9 46

DVIS

DVIS: Decoupled Video Instance Segmentation Framework

Language:PythonMIT124 4 33

DVIS_Plus

Language:PythonMIT88 3 21

PCM

Point Could Mamba: Point Cloud Learning via State Space Model

vis_clip

Language:PythonMIT2 1 1

DVIS-OV

Language:PythonNOASSERTION1 20

bubogpt

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

Language:PythonBSD-3-Clause000

CTVIS_VIT

Language:PythonMIT000

DCFormer

010

dfc-clip

Language:PythonApache-2.0010

DragGAN

Code for DragGAN (SIGGRAPH 2023)

000

huaman_seg

Language:PythonNOASSERTION010

fc-clip

This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Panoptic Segmentation with Single Frozen Convolutional CLIP

Language:PythonApache-2.0000

HIPIE

Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"

Language:Jupyter NotebookMIT000

InternLM

InternLM has open-sourced 7 and 20 billion parameter base models and chat models.

Language:PythonApache-2.0000

LLaVA

Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

Language:PythonApache-2.0000

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT000

mllmd_eval

Language:Python000

OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Language:PythonNOASSERTION000

OmniScient-Model

This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model

Language:Jupyter NotebookApache-2.0000

OpenSeeD

A Simple Framework for Open-Vocabulary Segmentation and Detection

Language:PythonApache-2.0000

Osprey

The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonApache-2.0000

paper_images

010

SAM-Adaptor-PyTorch

Language:PythonMIT000

Segment-Everything-Everywhere-All-At-Once

Language:PythonApache-2.0000

Semantic-SAM

Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

000

subobjects

Official repository of paper "Subobject-level Image Tokenization"

000

tap_llava

Language:PythonApache-2.0010

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"

Language:PythonMIT000

zhang-tao-whu.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

Language:JavaScriptMIT000