Xiaowei Chi's repositories

MMTrail

[Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Language:HTMLStargazers:15Issues:0Issues:0
Language:PythonLicense:MITStargazers:6Issues:0Issues:0

2D-Virtual-Data

BinCopyPaste: Several Clicks to build datasets for instance segmentation in bin-picking scenarios

Language:PythonStargazers:0Issues:0Issues:0

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

BEVDepth

Official code for BEVDepth.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LaVIT

LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

License:Apache-2.0Stargazers:0Issues:0Issues:0

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

License:Apache-2.0Stargazers:0Issues:0Issues:0

MISA

MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

modulated_fusion_transformer

Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition

Language:PythonStargazers:0Issues:0Issues:0

MOSEI_UMONS

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Multimodal-Infomax

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

SFA

Official Implementation of "Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers"

License:Apache-2.0Stargazers:0Issues:0Issues:0

Volumetric-Aggregation-Transformer

Official Implementation of VAT

License:MITStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0