Muzammal Naseer (Muzammal-Naseer)

Muzammal-Naseer

Geek Repo

Location:Abu Dhabi, UAE

Home Page:muzammal-naseer.com

Twitter:@NaseerMuzammal

Github PK Tool:Github PK Tool

Muzammal Naseer's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:48570Issues:542Issues:194
Language:PythonLicense:NOASSERTIONStargazers:7930Issues:149Issues:0

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:981Issues:14Issues:99

MobiLlama

MobiLlama : Small Language Model tailored for edge devices

Language:PythonLicense:Apache-2.0Stargazers:537Issues:12Issues:11

GeoChat

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

GiT

Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Language:PythonLicense:Apache-2.0Stargazers:218Issues:6Issues:7

PromptSRC

[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without Forgetting".

Language:PythonLicense:MITStargazers:190Issues:5Issues:13

Vita-CLIP

Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]

Language:PythonLicense:MITStargazers:98Issues:7Issues:10

Clip2Protect

[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".

Video-FocalNets

Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]

PromptAlign

[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization

ProText

Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".

Language:PythonLicense:MITStargazers:73Issues:3Issues:2

satmae_pp

Official repository for "Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery" (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:62Issues:0Issues:0

llmblueprint

[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"

Language:Jupyter NotebookStargazers:53Issues:2Issues:3

FLIP

Official implementation of the paper "FLIP: Cross-domain Face Anti-spoofing with Language Guidance". (ICCV 2023)

vafa

[MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation" accepted in MICCAI 2023 conference.

Language:PythonLicense:MITStargazers:45Issues:2Issues:1

cooperative-foundational-models

Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

Language:PythonLicense:MITStargazers:42Issues:6Issues:6

SegNext

Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts (CVPR 2024)

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

PromptCAL

Official Implementation of paper: PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery (CVPR'23)

Language:PythonLicense:MITStargazers:36Issues:4Issues:9

clippy

Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:33Issues:3Issues:1

CVRR-Evaluation-Suite

Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs".

Language:PythonLicense:CC-BY-4.0Stargazers:31Issues:0Issues:0

ObjectCompose

About Official repository of paper titled "ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes"

Language:Jupyter NotebookStargazers:29Issues:0Issues:0

composed-video-retrieval

Composed Video Retrieval

Language:PythonLicense:Apache-2.0Stargazers:27Issues:2Issues:3

LG_SDG

Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]

Language:Jupyter NotebookStargazers:25Issues:0Issues:0

S3A

repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)

Language:Jupyter NotebookStargazers:23Issues:2Issues:0

LWI-VMS

Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]

Language:PythonStargazers:22Issues:0Issues:0

DCViT-AT

Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

MedContext

Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"

Language:PythonStargazers:7Issues:0Issues:0