Multimedia Understanding and Processing's starred repositories

Language:PythonStargazers:18Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0
Language:PythonStargazers:7Issues:0Issues:0
Language:PythonLicense:MITStargazers:16Issues:0Issues:0

Group-Contextualization

[CVPR22] Group Contextualization for Video Recognition

Language:PythonLicense:Apache-2.0Stargazers:21Issues:0Issues:0

FTCM

(TCSVT 2023) FTCM: Frequency-Temporal Collaborative Module for Efficient 3D Human Pose Estimation in Video

Language:PythonStargazers:5Issues:0Issues:0

STCFormer

(CVPR2023)3D Human Pose Estimation with Spatio-Temporal Criss-cross Attention

Language:PythonStargazers:73Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18766Issues:0Issues:0

adapt-image-models

[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition

Language:PythonLicense:Apache-2.0Stargazers:259Issues:0Issues:0
Language:PythonStargazers:29Issues:0Issues:0
License:MITStargazers:1Issues:0Issues:0

ChatReviewer

ChatReviewer: 使用ChatGPT分析论文优缺点,提出改进建议

Language:PythonLicense:NOASSERTIONStargazers:1258Issues:0Issues:0

SDA

implementation for MM21 paper "Selective dependency aggregation for action classification"

Language:PythonLicense:MITStargazers:4Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0
Language:PythonLicense:MITStargazers:7Issues:0Issues:0

SLaK

[ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers than Transformers for ConvNets?"

Language:HTMLLicense:MITStargazers:259Issues:0Issues:0
Language:PythonStargazers:68Issues:0Issues:0

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

Stargazers:3325Issues:0Issues:0

temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Language:PythonLicense:MITStargazers:2046Issues:0Issues:0

Coherence-Enhancing-Diffusion-filtering

Coherence-Enhancing Diffusion Filtering is used in completion of interrupted lines or the enhancement of flow-like structures.

Language:MatlabStargazers:4Issues:0Issues:0

SMVH_matlab

This is the matlab code for Stochastic Multiview Hashing

Language:MatlabStargazers:5Issues:0Issues:0

FoodGAN

Generating food images

Language:PythonStargazers:4Issues:0Issues:0