spyysalo / genia-pos

GENIA corpus v3.02 part-of-speech annotations (GENIA tagger variant)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GENIA corpus v3.02 POS annotations (GENIA tagger variant)

This repository contains the version of the GENIA corpus part-of-speech annotations that were used to train the GENIA tagger and in the related experiments of Tsuruoka et al. (2005).

This data is made available separately for reference as the GENIA corpus v3.02 POS annotation does not specify a train/test split and differs from this version in some aspects of its formatting.

Thanks to Yoshimasa Tsuruoka for providing this data and agreeing to make it available.

References

  • Yoshimasa Tsuruoka, Yuka Tateishi, Jin-Dong Kim, Tomoko Ohta, John McNaught, Sophia Ananiadou, and Jun'ichi Tsujii, Developing a Robust Part-of-Speech Tagger for Biomedical Text, Advances in Informatics - 10th Panhellenic Conference on Informatics, LNCS 3746, pp. 382-392, 2005

About

GENIA corpus v3.02 part-of-speech annotations (GENIA tagger variant)

License:Other