Yifan Bai (AlexDotHam)

AlexDotHam

Geek Repo

Company:Xi'an Jiaotong University

Github PK Tool:Github PK Tool

Yifan Bai's starred repositories

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

PETR

[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Language:PythonLicense:NOASSERTIONStargazers:852Issues:14Issues:161

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

StreamPETR

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Language:PythonLicense:NOASSERTIONStargazers:556Issues:13Issues:223

prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Language:PythonLicense:MITStargazers:404Issues:12Issues:36

OSTrack

[ECCV 2022] Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

Language:PythonLicense:MITStargazers:363Issues:4Issues:118

Transformer_Tracking

This repository is a paper digest of Transformer-related approaches in visual tracking tasks.

Rewrite-the-Stars

[CVPR 2024] Rewrite the Stars

Language:PythonLicense:Apache-2.0Stargazers:242Issues:2Issues:17

mindcv

A toolbox of vision models and algorithms based on MindSpore

Language:PythonLicense:Apache-2.0Stargazers:230Issues:12Issues:170
Language:PythonLicense:Apache-2.0Stargazers:221Issues:13Issues:77

SED

[CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.

Language:PythonLicense:Apache-2.0Stargazers:109Issues:1Issues:21

SOLO

Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:102Issues:2Issues:1

3D-LR

Can 3D Vision-Language Models Truly Understand Natural Language?

Stargazers:20Issues:0Issues:0

SPEED

PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.

IT3DEgo

CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"

Language:Jupyter NotebookLicense:MITStargazers:12Issues:1Issues:1

EvoPrompt

PyTorch implementation of paper "Evolving Parameterized Prompt Memory for Continual Learning" in AAAI 2024 (Oral).

Language:PythonLicense:MITStargazers:6Issues:1Issues:2

ARTrack

Autoregressive Visual Tracking CVPR2023 (Highlight Top2.5%)

Language:PythonLicense:Apache-2.0Stargazers:4Issues:1Issues:0

DIBD

PyTorch implementation of paper "Blind Hyperspectral Image Denoising with Degradation Information Learning" in RemoteSensing2023.

Language:PythonStargazers:3Issues:0Issues:0
Language:PythonStargazers:2Issues:1Issues:0
Language:JavaScriptStargazers:1Issues:1Issues:0
Language:JavaScriptStargazers:1Issues:2Issues:0