information-extraction large-language-models

CodeIE

This is the official repository for "CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors" (ACL 2023).

If you have any questions or suggestions, please feel free to open issues or email Peng Li (lip21 [at] m.fudan.edu.cn).

Introduction

Our codebase is mainly modified from the UIE and CoCoGen repositories. We obtained and processed the original NER and RE datasets according to UIE's workflow. Then we implemented the inference logic of CodeIE referring to CoCoGen's workflow. Finally we evaluated the model outputs based on UIE's evaluation process.

We sincerely thank these two works and previous works.

Notice

A piece of not-so-good news is that the Codex models are now deprecated by OpenAI, which will have a significant impact on replicating this paper. Some possible solutions include applying for OpenAI's Researcher Access Program or accessing Codex on Azure OpenAI Service.
Since we are using the closed-source APIs of OpenAI, we are unaware of the technical details behind them, such as the specific pre-training corpora used. Therefore, there might be potential issues of data contamination in the evaluation of our paper. (Acknowledgement: Oscar Sainz)

Workflow

Please refer to the workflow.md file.

Citation

If the paper or this repository helps you, please cite our paper:

@inproceedings{DBLP:conf/acl/LiSTYWHQ23,
  author       = {Peng Li and
                  Tianxiang Sun and
                  Qiong Tang and
                  Hang Yan and
                  Yuanbin Wu and
                  Xuanjing Huang and
                  Xipeng Qiu},
  editor       = {Anna Rogers and
                  Jordan L. Boyd{-}Graber and
                  Naoaki Okazaki},
  title        = {CodeIE: Large Code Generation Models are Better Few-Shot Information
                  Extractors},
  booktitle    = {Proceedings of the 61st Annual Meeting of the Association for Computational
                  Linguistics (Volume 1: Long Papers), {ACL} 2023, Toronto, Canada,
                  July 9-14, 2023},
  pages        = {15339--15353},
  publisher    = {Association for Computational Linguistics},
  year         = {2023},
  url          = {https://doi.org/10.18653/v1/2023.acl-long.855},
  doi          = {10.18653/V1/2023.ACL-LONG.855},
  timestamp    = {Fri, 16 Feb 2024 08:27:36 +0100},
  biburl       = {https://dblp.org/rec/conf/acl/LiSTYWHQ23.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

About

[ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors

https://artpli.github.io/CodeIE/

information-extraction large-language-models

Languages

Language:Python 95.7%Language:Shell 4.3%