- 📖 About the Project
- 💻 Getting Started
- 👥 Authors
- 🤝 Contributing
- 🔭 Future Features
- ⭐️ Show your support
- 🙏 Acknowledgements
This study investigates the relationship between editors’ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page1. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction2. The study tests two hypotheses:
- Evolving policies do not reduce editor blocks and reblocks, and
- Biased treatment and vague punishment policies lead to more disruptive behavior.
The study finds evidence to support both hypotheses and discusses the implications for online governance and policymaking. The study also examines the diversity of language preferences and perspectives among editors and administrators, and suggests future directions for research.
Detailed report on this project is available here.
Python3
,Pandas
,NumPy
,Matplotlib
, andSeaborn
- This project demonstrates an overall evolution in the community standard of Wikipedia.
- Authors of this project have done both qualitative and quantitative analysis of the data to prove or disprove their two hypotheses.
- The project is open access and free to use for everyone.
- The collected data is also stored in a google drive of the authors which can be downloaded by any user.
- This project includes all codes including collection of data, cleaning of data, and visualization of data.
- The notebook also includes the primary analysis by the authors.
- The project report is linked in the project description.
- A requirements.txt file is added in the repository which can be used by a user to directly install all necessary
Python3
packages for this project.
To clone the repository in local environment try following steps.
- A web browser like Google Chrome.
- A code editor like Visual Studio Code with Git.
You can check if Git is installed by running the following command in the terminal.
$ git --version
Likewise for python
and pip
for package installation.
$ python --version
Clone the repository using this link.
In the terminal, go to your file directory and run this command.
$ git clone https://github.com/PrangonGhose/wikipedia-blocking-policies.git
In the terminal, run these commands to get into development.
$ cd wikipedia-blocking-policies
$ pip install -r requirements.txt
The required data folder can be manually downloaded from here and be unzipped manually by the user.
Or it can also be done automatically later while continuing with the project notebook.
Now you can open the wikipedia notebook and continue with the project.
👤 Prangon Ghose
- GitHub: @PrangonGhose
- LinkedIn: Prangon Ghose
👤 Md Abeed Hasan
- LinkedIn: Md Abeed Hasan
👤 Yushu Gong
- LinkedIn: Yushu Gong
👤 Ziqiao Zhang
👤 Ziyi Chen
Contributions, issues, and feature requests are welcome! Add suggestions by opening new issues.
Feel free to check the issues page.
- Utilizing editors' IP addresses sourced from Wikipedia's API can enhance research insights by identifying prevalent violation locations.
- Acquiring gender data of administrators can facilitate exploration of gender diversity and potential biases within the administrative landscape.
- Further investigation into correlations between edit frequency, language preferences, and information disparities across Wikipedia pages in various languages is warranted to deepen understanding of these complex dynamics.
Give a ⭐️ if you like this project!
Authors would like to thank:
- Mentor: Andrea Giraldo Sevilla
- Mentor: Marc Santolini
- Wikipedia API
- XTools