The aim of the project Wikipedia Administrative Pages Analytics is to generate datasets, visualizations, and tools to understand admin. pages across Wikipedia language editions.
This page serves as technical documentation for understanding the code, datasets, and databases generated.
Project page in Meta-wiki:
https://meta.wikimedia.org/wiki/Wikipedia_Administrative_Pages_Analytics
Databases. The code generates several databases containing the relevant data and features for the content gap metrics.
Pages and features database:
https://wapa.wmcloud.org/databases/wikipedia_administrative_pages_analytics_production.db (3.5 GB)
This database contains the admin pages mapping for every Wikipedia language edition (pages and features).
Stats and metrics database:
https://wapa.wmcloud.org/databases/stats_production.db (1 GB)
This database contains the basic metrics and statistics (e.g., selection, average of metrics, etc.).
Website with tools and visualizations:
For more information: