w64228013

w64228013

Geek Repo

Github PK Tool:Github PK Tool

w64228013's starred repositories

BorLan

[ICCV2023] Borrowing Knowledge From Pre-trained Language Model: A New Data-efficient Visual Learning Paradigm

Language:PythonLicense:MITStargazers:14Issues:0Issues:0

FGVP

Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023

Language:PythonStargazers:28Issues:0Issues:0

CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Language:PythonLicense:MITStargazers:1617Issues:0Issues:0

CCIM

[CVPR2023] Context De-confounded Emotion Recognition

Language:PythonLicense:MITStargazers:14Issues:0Issues:0
Language:PythonLicense:MITStargazers:43Issues:0Issues:0

AMC-grounding

[CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"

Language:Jupyter NotebookLicense:MITStargazers:16Issues:0Issues:0

PromptSRC

[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without Forgetting".

Language:PythonLicense:MITStargazers:206Issues:0Issues:0

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Language:PythonLicense:MITStargazers:594Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1715Issues:0Issues:0

EvidentialADA

Official implementation of Evidential Uncertainty Quantification: A Variance-Based Perspective [WACV 2024]

Language:PythonStargazers:11Issues:0Issues:0

LURE

[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models

Language:PythonStargazers:121Issues:0Issues:0

TSM

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:19Issues:0Issues:0

attention_branch_network

Attention Branch Network (CIFAR100, ImageNet models)

Language:PythonLicense:MITStargazers:267Issues:0Issues:0

CAER

My implementation for the paper Context-Aware Emotion Recognition Networks

Language:PythonStargazers:24Issues:0Issues:0

CLIP-self-attention-visualization

Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.

Language:PythonStargazers:35Issues:0Issues:0
Language:PythonLicense:MITStargazers:43Issues:0Issues:0

MSF-GZSSAR

Official code of the MSF model for GZSSAR (ICIG 2023)

Language:PythonStargazers:11Issues:0Issues:0

RMT

(CVPR2024)RMT: Retentive Networks Meet Vision Transformer

Language:PythonStargazers:261Issues:0Issues:0

STGAT

Skeleton-Based Action Recognition with Local Dynamic Spatial-Temporal Aggregation (Expert Systems with Applications 2023) (Previous name: Spatial Temporal Graph Attention Network for Skeleton-Based Action Recognition)

Language:PythonLicense:NOASSERTIONStargazers:35Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4525Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

SkeletonMAE

SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training

License:MITStargazers:87Issues:0Issues:0

LLM-scientific-feedback

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Language:PythonLicense:CC-BY-4.0Stargazers:483Issues:0Issues:0

pvic

[ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"

Language:PythonLicense:BSD-3-ClauseStargazers:61Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0

GAP

official implementation for Language Supervised Training for Skeleton-based Action Recognition

Language:PythonLicense:Apache-2.0Stargazers:90Issues:0Issues:0
Language:PythonStargazers:6Issues:0Issues:0

FreeU

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

License:MITStargazers:1634Issues:0Issues:0

meta-dataset

A dataset of datasets for learning to learn from few examples

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:751Issues:0Issues:0

AnomalyGPT

[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models

Language:PythonLicense:NOASSERTIONStargazers:705Issues:0Issues:0