jxzhangjhu / awesome-LMM-Hallucination

List of papers on Hallucination in LMM

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Large MultiModal Model HallucinationšŸ˜µ

LMM hallucination refers to occasional instances where LMMs generate content that appears plausible but deviates from or conflicts with the provided image. LMMs tend to rely more on their own parametric knowledge than on provided visual features, causing them to respond with guesses and generate multimodal hallucinations.

In the MLLM community, we've developed methods for detecting, evaluating, and mitigating hallucinationsšŸ‘.


Detecting

  1. FDPO: Detecting and Preventing Hallucinations in Large Vision Language Models, (Gunjal et al. 2023)
    • Static Badge Static Badge
  2. HaELM : Evaluation and Analysis of Hallucination in Large Vision-Language Models, (Wang et al. 2023a)
    • Static Badge
    • An automatic MLLM hallucination detection framework, Train LLM to detect
  3. HallE-Switch : Rethinking and Controlling Object Existence Hallucinations in Large Vision-Language Models for Detailed Caption, (Zhai et al. 2023)
    • Static Badge Static Badge

Evaluating

  1. POPE: Evaluating Object Hallucination in Large Vision-Language Models, (Li et al. EMNLP 2023)
    • Static Badge
    • Discriminative Task: Object Existence, 3k * 3 VQA pairs
    • LLM-free
  2. HaELM : Evaluation and Analysis of Hallucination in Large Vision-Language Models, (Wang et al. 2023a)
    • Static Badge
    • Discriminative Task, 1500 VQA pairs
  3. HallusionBench : An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Model, (Liu et al. 2023)
    • Static Badge
    • Image Reasoning Task, 200 VQA pairs
  4. HallE-Switch : Rethinking and Controlling Object Existence Hallucinations in Large Vision-Language Models for Detailed Caption, (Zhai et al. 2023)
    • Static Badge Static Badge
    • Discriminative Task: Object Existence
  5. NOPE: Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models, (Lovenia et al.)
    • Static Badge Static Badge
  6. Bingo : Holistic Analysis of Hallucination in GPT-4V(ision): Bias and Interference Challenges, (Cui et al.)
    • Static Badge
  7. FaithScore : Evaluating Hallucinations in Large Vision-Language Models, (Jing et al.)
    • Static Badge
    • Generative Task: Object Existence, Attribute, Relationship, 180 VQA pairs
    • open-end find-grained evaluation, need other models to help evaluation
  8. AMBER : An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation, (Wang et al.)
    • Static Badge
    • Discriminative Task: Object Existence, Attribute, Relationship
    • Generative Task: Object Existence
    • LLM-free

Mitigating

  1. LRV-Instruction : Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning, (Liu et al.)
    • Static Badge
    • [dataset] propose an instruction-tuning dataset which include both postive and negative sample
    • GAIVE: evaluation approch which use GPT-4
  2. LURE : Analyzing and Mitigating Object Hallucination in Large Vision-Language Models, (Zhou et al. 2023b)
    • Static Badge
    • [revision] train a revision model to detect and correct hallucinated objects in base modelā€™s response.
  3. HallE-Switch : Rethinking and Controlling Object Existence Hallucinations in Large Vision-Language Models for Detailed Caption, (Zhai et al. 2023)
    • Static Badge Static Badge
  4. Woodpecker : Hallucination Correction for Multimodal Large Language Models, (Yin et al.)
    • Static Badge
    • [revision] post-hoc correction
    • need other pretrained visual models
  5. LLaVA-RLHF : Aligning Large Multimodal Models with Factually Augmented RLHF, (Sun et al.)
    • Static Badge
    • [RLHF] the first LMM trained with RLHF
    • propose benchmark: MMHal-Bench
  6. Volcano : Mitigating Multimodal Hallucination through Self-Feedback Guided Revision, (Lee et al.)
    • Static Badge
    • self-feedback, according to self-generate natural language feedback to self-revise response
  7. HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Dataļ¼Œ (Yu et al.)
    • Static Badge
  8. VCD: Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
    • Static Badge
  9. HA-DPO: Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
    • Static Badge Static Badge
  10. Mitigating Hallucination in Visual Language Models with Visual Supervision
    • Static Badge Static Badge'

About

List of papers on Hallucination in LMM