This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings like fastText or ELMo Deep contextualized word representations.
Home Page:https://www.t-systems-onsite.de/impressum
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool