wikiwho / WikiWho

An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.

Home Page:http://f-squared.org/wikiwho

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

WikiWho

An algorithm to identify the revision origin of - and all changes ever applied to - the tokens of a revisioned Wiki document. This enables, e.g., detecting authorship and editor interactions.

Also check out the WikiWho API for current data from Wikipedia.

Requirements and Installation

WikiWho runs both on python 2 and 3.

requests package is required to get revision meta data and text from Wikipedia api.

mwxml package is required to get revision meta data and text from xml dumps. This package runs on only python 3.

Install WikiWho package using `pip`:

pip install WikiWho

Running WikiWho

You can check example scripts under WikiWho/WikiWho/examples to see how to run WikiWho.

Contact

  • Fabian Floeck: fabian.floeck[.]gesis.org
  • Maribel Acosta: maribel.acosta[.]kit.edu
  • Kenan Erdogan: kenan.erdogan[.]gesis.org

License

This work is licensed under MIT.

Developed at Karlsruhe Institute of Technology and GESIS - Leibniz Institute for the Social Sciences

About

An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.

http://f-squared.org/wikiwho

License:MIT License


Languages

Language:Python 100.0%