SURFZJY / german-wikipedia-text-corpus

This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings like fastText or ELMo Deep contextualized word representations.

Home Page:https://www.t-systems-onsite.de/impressum

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SURFZJY/german-wikipedia-text-corpus Issues

No issues in this repository yet.