Muzammal Naseer's starred repositories
Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Clip2Protect
[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".
Video-FocalNets
Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]
PromptAlign
[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization
llmblueprint
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
cooperative-foundational-models
Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"
CVRR-Evaluation-Suite
Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs".
ObjectCompose
About Official repository of paper titled "ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes"
composed-video-retrieval
Composed Video Retrieval
MedContext
Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"