Sihan Chen's starred repositories
node-v0.x-archive
Moved to https://github.com/nodejs/node
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
awesome-segment-anything
Tracking and collecting papers/projects/others related to Segment Anything.
RGBD_Semantic_Segmentation_PyTorch
[ECCV 2020] PyTorch Implementation of some RGBD Semantic Segmentation models.
VSUA-Captioning
Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019
MultiModal_BigModels_Survey
[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models
ChatBridge
ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.
OPT_Questioner
Official PyTorch implementation of the paper "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner"
longterm_datasets
Official repository of the paper "Are current long-term video understanding datasets long-term?", published in CVEU 2023.