putssander / filterbubbel-nlp

Retrain state of the art NLP models for Dutch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

(Re)Training for Dutch

This repo is to keep track of our work to improve on the current NLP tools for Dutch. We are interested in the following features::

  • semantic role labelling
  • co-refence resolution
  • dependency parsing

This is a field of active research, where a lot of progress is made using new machine learning techniques. Much of the research is done internationally and focusses on English. Our aim is to apply some of this work to Dutch.

The machine learning approaches crucially depend on the amount (and quality) of the data. Therefore, we will also inventory available data sets for supervised and unsupervised learning.

Note that:

About

Retrain state of the art NLP models for Dutch


Languages

Language:Python 97.6%Language:Shell 2.4%