slatex / sTeX-NER

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

sTeX-NER

This project is part of a Master's Thesis with the objective to identify missing semantic annotations in sTeX documents using machine learning. More specifically, BERT-based transformer language models are fine-tuned on flexiformal sTeX content in a token classification task.

A single drop-in artifact in form of a Java Archive can be generated and used with the sTeX-IDE.

Development

If you open this repository with Intellij Idea you will have simple gradle integration.

Otherwise, simply build a JAR artifact using

./gradlew jar

The result will be written to build/libs/.

About


Languages

Language:Java 100.0%