Computational Linguistics and & Text Mining Lab

cltl

Amsterdam

http://www.cltl.nl/

Computational Linguistics and & Text Mining Lab's repositories

EventStoryLine

Materials for the StoryLine extraction task - annotated data, baselines and evaluation scripts, evaluation data.

Language:PythonNOASSERTION36 160

svm_wsd

Word Sense Disambiguation system developed on the DutchSemCor project using Support Vector Machines. The input is plain text, and the output XML

Language:PythonGPL-3.013 140

opinion_miner_deluxe

Opinion miner based of Machine Learning that can be trained on a corpus of KAF/NAF files

Language:Python10 140

BabelfyReimplementation

Reimplementation of Babelfy (http://babelfy.org)

Language:Python900

morphosyntactic_parser_nl

Morphosyntactic parser for Dutch based on the Alpino parser

Language:PythonApache-2.05 10 6

EL-long-tail-phenomena

Systematic study of long tail phenomena in the task of entity linking

Language:Jupyter Notebook4 140

HumanLikeEL

Human-Like Entity Linking using Contextual knowledge

Language:Jupyter Notebook4 160

MoreIsNotAlwaysBetter

Language:Java300

multilingual_factuality

Language:Python300

WordNetMapper

This repo provides the possibility to map between lexical keys | offsets | ilidefs from one wordnet version to the other ["16","17","171","20","21","30"]. It makes use of the index.sense files from WordNet (http://wordnet.princeton.edu/) and the automatically generated mappings between WordNet offsets (http://nlp.lsi.upc.edu/tools/download-map.php)

Language:HTMLNOASSERTION300

GRaSP

200

hpsp

Experiments with hyperspace models for selectional preference

Language:Jupyter Notebook200

LSTM-WSD

Language:Python200

MFS_classifier

This repo contains the scripts to attempt to remove the mfs bias from a WSD system.

Language:PostScriptNOASSERTION200

SemanticOverfitting

Language:PythonNOASSERTION200

EmotionTagger

Uses an emotion tagger to tag text with emotions

Language:Java100

LongTailIdentity

Generating profiles of long tail identities from text

Language:Jupyter NotebookApache-2.0100

LongTailQATask

Language:Jupyter Notebook100

Profiling

Extraction and categorization of world knowledge about people from WIkidata for the sake of creating profiles

Language:Python1 140

relink

RElinking with CONtext - Entity linking module

100

TimeMLEventTrigger

Language:Python100

vua_factuality

Language:PythonApache-2.0100

cltl-magicplace

Annotate NAF-documents with the Newsreader pipeline on Lisa computer (SurfSara)

Language:M4GPL-2.0000

factuality_experimental_environment

Language:Python000

LOTUS

Code of LOTUS, the largest LOD text index, allowing free text access to the LOD Laundromat data collection

000

nwr-triple-api

Queries the KnowledgeStore populated with NewsReader output and represents the result as SEM-RDF or SEM-JSON

Language:Java000

OldBailey

Processing the OldBailey data to create LOD

Language:Java000

OpeNER_corpus

OpeNER corpus : news articles and hotel reviews annotated with opinion expressions, holders, targets and their relations

000

positive-interpretations

Code (Python 3.6) for automatically scoring and classifying positive interpretations generated from negations in OntoNotes

Language:Jupyter NotebookApache-2.0000

Spoken-versus-Written

Code and data for our VarDial 2018 paper on spoken versus written image descriptions

Language:RoffApache-2.0000