Awesome Multimodal Named Entity Recognition
A collection of resources on multimodal named entity recognition.
Content
1.Description
π Markdown Format:
π±: Novel idea
π: The first...
π : State-of-the-Artπ : Novel dataset/modelπ οΌDownstream Tasks
2. Topic Order
-
π Dataset
3. Chronological Order
-
2020
-
2021
- (AAAI 2021) Multi-modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance [paper]
- (AAAI 2021) RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER [paper] [code]
- (EMNLP 2021) Can images help recognize entities? A study of the role of images for Multimodal NER [paper] [code]
-
2022
-
(CVPR 2022) Flat Multi-modal Interaction Transformer for Named Entity Recognition [paper]
π 1st interpolating FLAT with MNERπ SOTA on Twitter15 with Bert_base_uncased but code is unavailable
-
(NAACL Findings 2022) Good Visual Guidance Makes A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction [paper] [code]
π code using refined Twitter15 dataset
-
(WSDM 2022) MAF: A General Matching and Alignment Framework for Multimodal Named Entity Recognition [paper] [code]
-
(SIGIR 2022) Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion [paper] [paper]
π 1st fully Transformer structureπ SOTA on Twitter17 using Bert_base_uncased but only implement on Twitter17
-
(NAACL 2022) ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition [paper] [code]
π Roberta_large as backbone provides powerful improvementsπ± Using OCR ect without directly using images
-
(MM 2022) Query Prior Matters: A MRC Framework for Multimodal Named Entity Recognition [paper]
π± 1st MRC based framework for MNER
-
(SIGIR 2022) Learning from Different text-image Pairs: A Relation-enhanced Graph Convolutional Network for Multimodal NER [paper]
π Trustworthy performance by reimplementation
-
(ICME 2022) CAT-MNER: Multimodal Named Entity Recognition with Knowledge-Refined Cross-Modal Attention [paper]
π SOTA on Twitter15 and Twitter17 with Roberta_largeπ Require 8 V100 GPU
-
(DSAA 2022) PromptMNER: Prompt-Based Entity-Related Visual Clue Extraction and Integration for Multimodal Named Entity Recognition [paper]
π SOTA on Twitter15 and Twitter17 with Roberta_largeπ Require 8 V100 GPUπ± Prompt-based
-
οΌarxiv 2022οΌ Multi-Granularity Cross-Modality Representation Learning for Named Entity Recognition on Social Media [paper] [code]
-
(arxiv) Multi-Granularity Contrastive Knowledge Distillation for Multimodal Named Entity Recognition
-