Wprofessor's starred repositories

clash-verge-rev

Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)

Language:TypeScriptLicense:GPL-3.0Stargazers:31657Issues:104Issues:1502

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19083Issues:157Issues:1466

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:8282Issues:101Issues:1174

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2776Issues:30Issues:107

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1737Issues:11Issues:136

Change-Detection-Review

A review of change detection methods, including codes and open data sets for deep learning. From paper: change detection based on artificial intelligence: state-of-the-art and challenges.

Neural-Network-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

LaVIN

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

GroundingGPT

[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model

Language:PythonLicense:Apache-2.0Stargazers:282Issues:14Issues:10

dora

Implementation of DoRA

Language:PythonLicense:MITStargazers:276Issues:10Issues:2

VCoder

VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024

Language:PythonLicense:Apache-2.0Stargazers:253Issues:9Issues:7

ScConv

🕹️SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy

Remote-Sensing-ChatGPT

Chat with RS-ChatGPT and get the remote sensing interpretation results and the response!

RepAdapter

Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".

RS5M

RS5M: a large-scale vision language dataset for remote sensing

Language:PythonLicense:MITStargazers:190Issues:10Issues:21

ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Language:PythonLicense:Apache-2.0Stargazers:148Issues:7Issues:11

AiATrack

[ECCV'22] The official PyTorch implementation of our ECCV 2022 paper: "AiATrack: Attention in Attention for Transformer Visual Tracking".

Language:PythonLicense:MITStargazers:105Issues:4Issues:21

sscdnet

Semantic Scene Change Detection Network (CSCDNet + SSCDNet)

Language:PythonLicense:MITStargazers:103Issues:7Issues:15

SegMiF

ICCV2023 | Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation

ODTrack

The official implementation for the paper [ODTrack: Online Dense Temporal Token Learning for Visual Tracking].

Language:PythonLicense:MITStargazers:91Issues:6Issues:11

Change-Agent

Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and Analysis

Language:PythonLicense:MITStargazers:69Issues:5Issues:2

pa-sam

PA-SAM: Prompt Adapter SAM for High-quality Image Segmentation

TrackGPT

Tracking with Human-Intent Reasoning

Language:PythonLicense:Apache-2.0Stargazers:60Issues:3Issues:11

Changen

Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:47Issues:3Issues:3
Language:PythonLicense:MITStargazers:46Issues:1Issues:18

TGP-T

[AAAI2024] Official implementation of the AAAI 2024 paper TGP-T