FuxiaoLiu

Fuxiao Liu's repositories

LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Language:PythonBSD-3-Clause255 11 23

VisualNews-Repository

[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning

Language:Jupyter Notebook87 14 4

MMC

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

Language:Python84 6 18

DocumentCLIP

[ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents

Language:PythonNOASSERTION16 50

Twitter-Video-dataset

[EACL'23] COVID-VTS: Fact Extraction and Verification on Short Video Platforms

9 1 1

HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

3 20

Large-Multimodal-Hallucination

300

awesome-Large-MultiModal-Hallucination

😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.

200

EAGLE

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Apache-2.0100

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

000

calvinliu123

Language:Jupyter NotebookMIT000

calvinliu123.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT000

Classproject_VIL

010

CMSC722_project

010

fuxiaoliu.github.io

Language:HTML000

GoodNews

Good News Everyone! - CVPR 2019

Language:Python000

LLaVA

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

Language:PythonApache-2.0000

LRV

Language:JavaScript000

M3Exam

Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"

Language:Python000

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

Language:PythonBSD-3-Clause000

mplug_implementation_fl

000

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION000

Recommendation-System

Language:Python010

Role-Embedding

010

SAT

Language:PythonApache-2.001 1

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonApache-2.0000

TCP

[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.

Language:PythonApache-2.0000

tool4ipp

This repository contains a data conversion tool for Image Position Prediction task proposed in our paper

Apache-2.0000

Twitter-COMMs

Language:Python000

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonApache-2.0000