RPMAN666's starred repositories

3D-ResNets-PyTorch

3D ResNets for Action Recognition (CVPR 2018)

Language:PythonLicense:MITStargazers:3867Issues:58Issues:269

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2786Issues:30Issues:107

InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1289Issues:28Issues:159

xlstm

Official repository of the xLSTM.

Language:PythonLicense:AGPL-3.0Stargazers:1211Issues:13Issues:43

awesome-multimodal-in-medical-imaging

A collection of resources on applications of multi-modal learning in medical imaging.

CVPR2023-DMVFN

CVPR2023 (highlight) - A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

Language:Jupyter NotebookLicense:MITStargazers:332Issues:7Issues:15

CEN

[TPAMI 2023, NeurIPS 2020] Code release for "Deep Multimodal Fusion by Channel Exchanging"

Language:PythonLicense:MITStargazers:284Issues:6Issues:17

ChineseResearchLaTeX

**科研常用LaTeX模板集

Language:TeXLicense:MITStargazers:265Issues:7Issues:7

LoGoNet

[CVPR2023] LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion

EfficientFace

[AAAI'21] Robust Lightweight Facial Expression Recognition Network with Label Distribution Training

Language:PythonLicense:MITStargazers:183Issues:1Issues:29

durian-pytorch

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:182Issues:8Issues:10

MMSA-FET

A Tool for extracting multimodal features from videos.

Language:PythonLicense:GPL-3.0Stargazers:127Issues:6Issues:38

CarTeller

汽车识别(包括车牌、车型、车品牌、属性、及驾驶员违规行为识别检测)

once_power

🛠 A tool based on Flutter for bulk renaming files and the ability to remove useless nested folders

Language:DartLicense:GPL-2.0Stargazers:113Issues:2Issues:4

multimodal-emotion-recognition

This repository provides implementation for the paper "Self-attention fusion for audiovisual emotion recognition with incomplete data".

Language:PythonLicense:MITStargazers:96Issues:4Issues:21

multimodal-image-fusion-to-detect-brain-tumors

Multi-modal medical image fusion to detect brain tumors using MRI and CT images

Language:Jupyter NotebookLicense:MITStargazers:87Issues:4Issues:2

DMD

An official implementation of "Decoupled Multimodal Distilling for Emotion Recognition" in PyTorch. (CVPR 2023 highlight)

Language:PythonLicense:MITStargazers:87Issues:6Issues:15

MSAF

Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"

Language:PythonLicense:MITStargazers:73Issues:4Issues:11

VAANet

[AAAI 2020] Official implementation of VAANet for Emotion Recognition

Multimodal-action-recognition

Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.

CoMPM

Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation (NAACL 2022)

CVPR2023_Top_Open_Papers

This repository is a curated collection of the most exciting and influential CVPR 2023 opensource works [Paper + Code].🔥

Language:HTMLStargazers:59Issues:3Issues:0

EmoTx-CVPR2023

[CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/2304.05634

TIM

Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"

CTEN

[CVPR 2023] This is the official implementation of "Weakly Supervised Video Emotion Detection and Prediction via Cross-Modal Temporal Erasing Network"

PS-Mixer

PS-Mixer: A Polar-Vector and Strength-Vector Mixer Model for Multimodal Sentiment Analysis

Language:PythonLicense:MITStargazers:29Issues:1Issues:3

MMANet-CVPR2023

offical code for MMANet: Margin-aware Distillation and Modality-aware Regularization for Incomplete Multimodal Learning

Language:PythonLicense:MITStargazers:29Issues:3Issues:6

conclugen

Official repository for our CVPR 2024 Workshop paper "Multi-Task Multi-Modal Self-Supervised Learning for Facial Expression Recognition".

Language:PythonStargazers:12Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0