0. What is this?

This is a solution to the EBM-NLP task proposed in this ACL 2018 publication by Benjamin Nye et al.

The method is Named Entity Recognition (NER) with BioELMo + CRF under PyTorch implementation.

1. Preparation

$ git clone https://github.com/iBotamon/ebmnlp.git

$ cd ebmnlp
$ python -m venv .
$ source bin/activate

$ pip install --upgrade pip
$ pip install -r requirements.txt

Instead, you can also download them by running this:

$ bash get_pretrained_models.sh

$ python ebmnlp.py TEXT_FILE_NAME

I-I	Remdesivir
O	in
I-P	adults
I-P	with
I-P	severe
I-P	COVID-19
I-P	:
O	a
O	randomised
O	,
O	double-blind
O	,
O	placebo-controlled
O	,
O	multicentre

$ python ebmnlp.py TEXT_FILE_NAME OUTPUT_FILE_NAME

$ bash run_flask.sh

Prepare EBM-NLP dataset ebm_nlp_1_00.tar.gz from the repository by the authors.
Extract ebm_nlp_1_00.tar.gz in the official directory like this:

- models
- templates
- official
  └ ebm_nlp_1_00
    └ annotations
      └ ..
    └ documents
      └ ..

$ python ebmnlp_bioelmo_crf.py

You can specify CUDA device number like this:

$ python ebmnlp_bioelmo_crf.py --cuda 3