There are 0 repository under wikidump-parser topic.
A tool to extract plain (unformatted) multilingual text, redirects, links and categories from wikipedia backups (dumps). Designed to prepare clean training data for AI training / Machine Learning software.
A .NET library designed for streamlined reading and handling of ZIM files.
Convert Wikipedia dumps to Neo4j loadable CSVs, efficiently transforming Wikipedia data for graph database usage.
A repo for my MS Project titled "Fake-news detection".