Fuxiao Liu (FuxiaoLiu)

FuxiaoLiu

Geek Repo

Home Page:https://fuxiaoliu.github.io

Twitter:@FuxiaoL

Github PK Tool:Github PK Tool

Fuxiao Liu's repositories

LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Language:PythonLicense:BSD-3-ClauseStargazers:255Issues:11Issues:23

VisualNews-Repository

[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning

Language:Jupyter NotebookStargazers:87Issues:14Issues:4

MMC

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

DocumentCLIP

[ICPRAI 2024] DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents

Language:PythonLicense:NOASSERTIONStargazers:16Issues:5Issues:0

Twitter-Video-dataset

[EACL'23] COVID-VTS: Fact Extraction and Verification on Short Video Platforms

HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

awesome-Large-MultiModal-Hallucination

😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.

Stargazers:2Issues:0Issues:0

EAGLE

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

License:Apache-2.0Stargazers:1Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

calvinliu123.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

GoodNews

Good News Everyone! - CVPR 2019

Language:PythonStargazers:0Issues:0Issues:0

LLaVA

Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

M3Exam

Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"

Language:PythonStargazers:0Issues:0Issues:0

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:1

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TCP

[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tool4ipp

This repository contains a data conversion tool for Image Position Prediction task proposed in our paper

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0