Beast code in Giters

Xiaowei Chi's repositories

MMTrail

[Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Language:HTML1500

2D-Virtual-Data

BinCopyPaste: Several Clicks to build datasets for instance segmentation in bin-picking scenarios

Language:Python000

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonApache-2.0000

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonMIT000

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT000

BEVDepth

Official code for BEVDepth.

Language:PythonMIT000

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Apache-2.0000

LaVIT

LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

NOASSERTION000

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.0000

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Apache-2.0000

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonMIT000

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Apache-2.0000

MISA

MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

Language:PythonMIT000

modulated_fusion_transformer

Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition

Language:Python000

MOSEI_UMONS

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis

Language:PythonMIT000

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Language:PythonMIT000

QueryRCNN

Language:Python000

SFA

Official Implementation of "Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers"

Apache-2.0000

Volumetric-Aggregation-Transformer

Official Implementation of VAT

MIT000

VTK-AR

Language:JavaScript000

xiaoweichi.com

Language:HTML000

litwellchi

Xiaowei Chi's repositories

M2Chat

MMTrail

BEV-SAN

2D-Virtual-Data

academicpages.github.io

AnimateDiff

Auto-GPT

baselines

BEVDepth

UTMP

Category-6D-Pose

detectron2

dynamic_grasping

LaVIT

LitTools

litwellchi.github.io

LLaMA-Adapter

llama-illusion

magvit

magvit2-pytorch

MiniGPT-5

MISA

modulated_fusion_transformer

MOSEI_UMONS

Multimodal-Infomax

QueryRCNN

SFA

Volumetric-Aggregation-Transformer

VTK-AR

xiaoweichi.com