Gregor Weichbrodt's repositories
wikitable2csv
A web tool to convert Wiki tables to CSV 📈
german-nouns
A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.
wiktionary-de-parser
Extract data from German Wiktionary XML files.
dewiki-wordrank
Tab-delimited word frequency list compiled from the German Wikipedia
textstelle
Textstelle is a collection of corpora for the creation of bots and other things that generate text 🤖
newscorpus
A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.
Theken-Texte
Chrome Browser-Erweiterung, die die Untertitel der ARD- und ZDF Mediatheken im Ganzen anzeigt, als Fließtext oder im Untertitel-Format bereit stellt (veraltet)
pyrogram
Telegram MTProto API Client Library and Framework in Pure Python for Users and Bots