SeonjeongHwang / ezTag

Web interface that allows users to perform computer-assisted text annotation

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Tagging biomedical concepts via interactive learning

ezTag is a web-based concept tagging tool that allows users to manually annotate text with biomedical concepts, use annotated data to train models, and use trained models to tag text automatically. Because ezTag is interactive, the automatically tagged text can then be refined manually to create new annotated data for training an improved model.

In ezTag, users can upload documents in BioC format, including PubMed abstracts and PubMed Central full-text articles. Biomedical concepts (biomedical named entities and their concept IDs) can then be annotated with one of several automated tools:

  • State-of-the-art entity tagging tools such as TaggerOne, GNormPlus and tmVar

  • Our string match algorithm, using a user-provided lexicon

  • Customized tagging models (TaggerOne) trained on a set of annotated documents (i.e. a collection).

NOTE: This repository contains the source code of the ezTag web interface (concept tagging tools excluded).

About

Web interface that allows users to perform computer-assisted text annotation


Languages

Language:JavaScript 41.6%Language:Ruby 31.7%Language:HTML 20.4%Language:CSS 5.9%Language:CoffeeScript 0.4%