There are 4 repositories under wikipedia-scraper topic.
Python wrapper for Wikipedia
Web scraping, data parsing and automation tutorials. Suited for both beginners and intermediate/advanced programmers.
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
Java tool to get wikipedia data
A :robot: which provides features from Wikipedia like summary, title searches, location API etc.
Graphically display the connections between different Wikipedia articles
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
SpaceX Launches 🚀 and Starlink Satellites 🛰
Collects a multimodal dataset of Wikipedia articles and their images
Music tagger with GUI that parses wikipedia for information. Can also download album art and lyrics.
This project collects Wikipedia articles from a search term entered by the user and formats the data into a .docx (Word Document) document with images related to each section of the collected article.
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
Wikipedia Article Summarizer a simple Python project based on NLP techniques
A tutorial and code samples of web scraping with PHP
Taxonomic trees (cladograms) from Wikipedia-scraped data.
Extracts geodata from a wikipedia dump
Linked Data Knowledge Base Population (KBP) framework built on top of Snorkel. The default configuration uses Wikipedia as text corpus and DBpedia as target.
Wikipedia Scraper written in PHP
A minimally dependent Wikimedia CLI
Scraping Wikipedia using the python wrapper of Wikipedia's WikiMedia API
Given a topic name this project finds you the most suitable parent topics for the topic, searching the Wikipedia Category Network (WCN) related to that topic by the help of a statistical approach. It helps you form an autogenerated topic tree for a given topic.
A Wikipedia Web Scraper used to download all the text information in a .txt file.
Scraping logos of world football clubs from wikipedia
🌍 Wikipedia Title Translator is a Chrome extension that displays the translation of any Wikipedia article title in your selected language.
This repository contains the query and processing code to support the publication "Wikipedia curation and the US-EPA CompTox Chemicals Dashboard."
This is a Python - based application that allows the user to search for information and open URLs.
A tutorial for web scraping with R
Web Scraping Wikipedia for Disney Movies to create a Disney Movies dataset and then cleaning the data to perform further Data Analysis using the cleaned JSON
Website with interactive game, where you have to travel from random page on Wikipedia to Adolf Hitler's page (or any page specified by you in settings).
Scrape List of American films of 2022 from wikipedia
Scrapes data from Wikipedia and generates a Graph based on inputs