Mennatullah Siam's repositories
TFSegmentation
RTSeg: Real-time Semantic Segmentation Comparative Study
AdaptiveMaskedProxies
Adaptive Masked Proxies for Few Shot Semantic Segmentation
video_class_agnostic_segmentation
Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".
MMC-MultiscaleMemory
Official Implementation of Multiscale Memory Comparator
Awesome-Visual-Grounding
[TPAMI 2025] Towards Visual Grounding: A Survey
groundLMM
Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
MeViS
[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
PixFoundationSeries
PixFoundation Series Project Webpage
Sa2VA
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
VideoGLaMM
[CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
VisTR-OVIS
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers