Muzammal-Naseer

Muzammal Naseer's starred repositories

grok-1

Grok open release

Language:PythonApache-2.048570 542 194

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.0981 14 99

MobiLlama

MobiLlama : Small Language Model tailored for edge devices

Language:PythonApache-2.0537 12 11

Awesome-CV-Foundational-Models

420 20 6

GeoChat

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

Language:Python299 7 37

GiT

Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Language:PythonApache-2.0218 6 7

PromptSRC

[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without Forgetting".

Language:PythonMIT190 5 13

Vita-CLIP

Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]

Language:PythonMIT98 7 10

Clip2Protect

[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".

Language:Python96 6 12

Video-FocalNets

Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]

Language:Python82 6 4

PromptAlign

[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization

Language:Python77 3 3

ProText

Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".

Language:PythonMIT73 3 2

satmae_pp

Official repository for "Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery" (CVPR 2024)

Language:PythonApache-2.06200

llmblueprint

[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"

Language:Jupyter Notebook53 2 3

FLIP

Official implementation of the paper "FLIP: Cross-domain Face Anti-spoofing with Language Guidance". (ICCV 2023)

Language:Python48 2 5

vafa

[MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation" accepted in MICCAI 2023 conference.

Language:PythonMIT45 2 1

cooperative-foundational-models

Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

Language:PythonMIT42 6 6

SegNext

Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts (CVPR 2024)

Language:PythonMIT3900

PromptCAL

Official Implementation of paper: PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery (CVPR'23)

Language:PythonMIT36 4 9

clippy

Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)

Language:Jupyter NotebookGPL-3.033 3 1

CVRR-Evaluation-Suite

Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs".

Language:PythonCC-BY-4.03100

ObjectCompose

About Official repository of paper titled "ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes"

Language:Jupyter Notebook2900

composed-video-retrieval

Composed Video Retrieval

Language:PythonApache-2.027 2 3

LG_SDG

Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]

Language:Jupyter Notebook2500

S3A

repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)

Language:Jupyter Notebook23 20

LWI-VMS

Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]

Language:Python2200

DCViT-AT

Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)

Language:Python19 1 1

HLSS

Language:PythonMIT1700

MedContext

Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"

Language:Python700