tsterbak / German-NLP

Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

German-NLP

Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German Awesome

Resources and tools which can be used either off-the-shelf or with minor adjustments and which are currently maintained are primarily chosen for this list. It is deliberately biased in terms of usability and user-friendliness.

Pull requests and suggestions are welcome! See contributing guidelines.

Table of Contents

Corpora

General-purpose

Historical

Specialized

Swiss German

Learner and Error Corpora

Word lists

Lists

Data acquisition

Generic resources

Frameworks

Treebanks

Annotation

Standards

Linguistic processing

Tokenization

Stemming

Lemmatization

Morphological analysis

Normalization

POS-tagging

Syntactical parsing

Named Entity Recognition

Text generation

Industry/Applications

Evaluation

Semantic analysis

Datasets

Word embeddings and senses

Sentiment analysis datasets / polarity clues

Sentiment detection

GermEval

(category to improve)

Discourse

Summarization

Psycholinguistics

Speech NLP

Machine Translation

Parallel corpora

Teaching resources and tutorials

More lists

German

General

Comparable lists

Larger institutional GitHub groups

License

CC-BY

About

Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German