mustansarfiaz

Mustansar Fiaz's repositories

ScratchFormer

ScratchFormer: Remote Sensing Change Detection With Transformers Trained from Scratch

Language:Python35 5 2

SA2-Net

SA2-Net: Scale-aware Attention Network for Microscopic Image Segmentation (BMVC'23 -- Oral)

Language:Python16 30

Abstract. Person search is a challenging problem with various real- world applications, that aims at joint person detection and re-identification of a query person from uncropped gallery images. Although, previous study focuses on rich feature information learning, it’s still hard to re- trieve the query person due to the occurrence of appearance deformations and background distractors. In this paper, we propose a novel attention- aware relation mixer (ARM) module for person search, which exploits the global relation between different local regions within RoI of a per- son and make it robust against various appearance deformations and occlusion. The proposed ARM is composed of a relation mixer block and a spatio-channel attention layer. The relation mixer block introduces a spatially attended spatial mixing and a channel-wise attended channel mixing for effectively capturing discriminative relation features within an RoI. These discriminative relation features are further enriched by intro- ducing a spatio-channel attention where the foreground and background discriminability is empowered in a joint spatio-channel space. Our ARM module is generic and it does not rely on fine-grained supervisions or topological assumptions, hence being easily integrated into any Faster R-CNN based person search methods. Comprehensive experiments are performed on two challenging benchmark datasets: CUHK-SYSU [1] and PRW [2]. Our PS-ARM achieves state-of-the-art performance on both datasets. On the challenging PRW dataset, our PS-ARM achieves an absolute gain of 5% in the mAP score over SeqNet, while operating at a comparable speed

Language:PythonMIT13 4 1

DDAM-PS

DDAM-PS: Diligent Domain Adaptive Mixer for Person Search -- WACV2024

Language:Python10 1 2

SAT

SAT: Scale-Augmented Transformer for Person Search

Language:Python4 3 4

SCS-Siam

SCS-Siam: Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking

Language:Python3 2 1

IRCA-Siam

IRCA-Siam: Improving Object Tracking by Added Noise and Channel Attention

Language:Python1 10

Large-Selective-Kernel-Network

Language:PythonApache-2.0100

SiamTrackers

(2020)The PyTorch version of Siamese ，SiamFC，SiamRPN，DaSiamRPN，UpdateNet，SiamDW，SiamRPN++, SiamMask，and SiamFC++ ; Visual object tracking based on deep learning

Language:Python1 10

AFS-Siam

AFS-Siam: Adaptive Feature Selection Siamese Networks for Visual Tracking

Language:Python020

Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

000

benchmark_results

Visual Tracking Paper List

000

COAT

Official Code for CVPR 2022 paper Cascade Transformers for End-to-End Person Search

Language:Python010

Directional-Deep-Embedding-and-Appearance-Learning-for-Fast-Video-Object-Segmentation

We propose a directional deep embedding and appearance learning (DDEAL) method, which is free of the online fine-tuning process, for fast VOS. DDEAL achieves a J & F mean score of 74.8% on DAVIS 2017 dataset and an overall score G of 71.3% on the large-scale YouTube-VOS dataset, while retaining a speed of 25 fps with a single NVIDIA TITAN Xp GPU. Furthermore, our faster version runs 31 fps with only a little accuracy loss.

Language:Python000