Language Machines (LanguageMachines)

Language Machines

LanguageMachines

Geek Repo

NLP Research group at Centre for Language Studies, Radboud University Nijmegen

Location:Nijmegen, The Netherlands

Home Page:http://cls.ru.nl/languagemachines

Github PK Tool:Github PK Tool

Language Machines's repositories

frog

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

Language:C++License:GPL-3.0Stargazers:73Issues:16Issues:102

ucto

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --

Language:C++License:GPL-3.0Stargazers:60Issues:13Issues:91

PICCL

A set of workflows for corpus building through OCR, post-correction and normalisation

Language:PythonLicense:NOASSERTIONStargazers:46Issues:8Issues:64

timbl

TiMBL implements several memory-based learning algorithms.

Language:C++License:GPL-3.0Stargazers:45Issues:8Issues:12

libfolia

FoLiA library for C++

Language:C++License:GPL-3.0Stargazers:14Issues:10Issues:55

ticcltools

Tools for TICCL

Language:C++License:GPL-3.0Stargazers:13Issues:8Issues:46

mbt

MBT: Memory-based tagger generation and tagging MBT is a memory-based tagger-generator and tagger in one.

Language:C++License:GPL-3.0Stargazers:9Issues:9Issues:7

uctodata

Datafiles for the tokenizer ucto.

Language:ShellLicense:GPL-3.0Stargazers:9Issues:7Issues:9

ticcutils

Ticcutils, a generic utility library shared by our software.

Language:C++License:GPL-3.0Stargazers:6Issues:10Issues:26

wopr

Memory Based Word Predictor/Language Model http://ilk.uvt.nl/wopr/

Language:C++License:NOASSERTIONStargazers:5Issues:7Issues:3

foliautils

Command-line utilities for working with the Format for Linguistic Annotation (FoLiA), powered by libfolia (C++), written by Ko van der Sloot (CLST, Radboud University)

Language:C++License:GPL-3.0Stargazers:4Issues:9Issues:71

timblserver

TiMBL implements several memory-based learning algorithms. This is the server part.

Language:C++License:GPL-3.0Stargazers:3Issues:7Issues:3

dimbl

Distributed Tilburg Memory Based Learner

Language:C++License:GPL-3.0Stargazers:2Issues:19Issues:0

dialect2keywords

Webinterface designed to convert words in Dutch dialects ("dialectopgaven") into standard Dutch keywords ("vernederlandste trefwoorden").

Language:PythonStargazers:1Issues:6Issues:0

frogdata

Data for Frog, mandatory

Language:LexLicense:GPL-3.0Stargazers:1Issues:9Issues:6
Language:C++License:GPL-3.0Stargazers:1Issues:7Issues:3
Language:PythonStargazers:1Issues:18Issues:0

toad

Toad: Trainer Of All Data, the Frog training collection

Language:C++License:GPL-3.0Stargazers:1Issues:21Issues:4
Language:CSSStargazers:0Issues:8Issues:1

bioport

Scrape pages about persons ('biographies') from Wikipedia.

Language:PythonStargazers:0Issues:6Issues:1

clariah-plus-tasks

An overview of CLARIAH-PLUS tasks at CLST, Radboud University, Nijmegen

Language:MakefileStargazers:0Issues:4Issues:0

foliatest

Test suite for libfolia

Language:C++License:GPL-3.0Stargazers:0Issues:7Issues:1

frogtests

Unit tests for Frog

Language:LexStargazers:0Issues:7Issues:1

JASMIN-BLISS-Negation

Documentation of a corpus sample of Dutch human-computer dialogues annotated with negation cues.

Stargazers:0Issues:4Issues:0

json

JSON for Modern C++

Language:C++License:MITStargazers:0Issues:2Issues:0
Language:ShellStargazers:0Issues:5Issues:1

mbttests

Unit tests for Mbt

Language:LexStargazers:0Issues:17Issues:0
Language:PythonStargazers:0Issues:5Issues:0

timbltests

Unit tests for Timbl

Language:EuphoriaStargazers:0Issues:16Issues:0

travistest

small program to test travis issues. Like OSX and Clang OpenMP support

Language:M4Stargazers:0Issues:3Issues:0