Beast code in Giters

0wj0's starred repositories

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

11972 270 109

Book4_Power-of-Matrix

Book_4_《矩阵力量》 | 鸢尾花书：从加减乘除到机器学习；上架！

Language:Python8615 73 161

nebuly

The user analytics platform for LLMs

Language:PythonApache-2.08365 93 202

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonBSD-3-Clause2729 32 156

HITSZ-OpenCS

哈尔滨工业大学（深圳）计算机专业课程攻略 | Guidance for courses in Department of Computer Science, Harbin Institute of Technology (Shenzhen)

Language:C1536 28 9

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.01169 14 119

Pytorch-Memory-Utils

pytorch memory track code

Language:Python997 16 22

MovieChat

[CVPR 2024] 🎬💭 chat with over 10K frames of video!

Language:PythonBSD-3-Clause496 10 75

pytorch_graph-rel

A PyTorch implementation of GraphRel

Language:PythonMIT268 6 31

LM4VisualEncoding

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

Language:PythonMIT218 4 9

MISA

MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

Language:PythonMIT192 50

language_modeling_via_stochastic_processes

Language modeling via stochastic processes. Oral @ ICLR 2022.

Language:Python134 7 15

AdaShare

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning

Language:Python110 5 20

ACM-GNN

NeurIPS 2022, Revisiting Heterophily For Graph Neural Networks, official PyTorch implementation for Adaptive Channel Mixing (ACM) GNN framework

Language:PythonMIT68 70

c-sts

[EMNLP 2023] C-STS: Conditional Semantic Textual Similarity

Language:Python66 4 6

GCNet

GCNet, official pytorch implementation of our paper "GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation"

Language:Python63 5 3

MEmoR

Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.

Language:Python50 1 4

Emotion-Recognition-in-Conversations

User Emotion Recognition and Response Generation in Dialogue Text

Language:Python37 1 5

tdlm

实现了Transformer中的几种位置编码方案

Language:Python36 20

MECPE

[TAFFC 2022] Multimodal Emotion-Cause Pair Extraction in Conversations

Language:Python32 3 8

MultiEMO-ACL2023

MultiEMO: An Attention-Based Correlation-Aware Multimodal Fusion Framework for Emotion Recognition in Conversations (ACL 2023)

Language:Python30 3 3

UniS-MMC

Code for UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning (ACL 2023)

Language:PythonMIT29 3 7

VSP

Language:Python21 3 2

Color4Dial

Code and data for "Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue" (ACL Findings 2023).

Language:PythonMIT21 2 4

MMSD2.0

[ACL2023] Code and dataset for paper "MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System"

Language:Python21 1 3

masters-of-our-EMNLP2023-papers

Pytorch code for EMNLP 2023 accepted-main paper "How to Enhance Causal Discrimination of Utterances: A Case on Affective Reasoning" and paper "Learning a Structural Causal Model for Intuition Reasoning in Conversation" (TKDE)

Language:PythonApache-2.01400

VSTAR

[ACL2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information

Language:Python12 10

M3GAT

Language:Python7 10

MEMEX_Meme_Evidence

Official repo for ACL'23 (main) paper - MEMEX: Detecting Explanatory Evidence for Memes via Knowledge-Enriched Contextualization

Language:Python7 3 3

ECPEC

Code for JointEC model

Language:Python300