wikipedia-scraper

There are 4 repositories under wikipedia-scraper topic.

martin-majlis / Wikipedia-API
Python wrapper for Wikipedia
wikipedia wikipedia-api python3 wikipedia-scraper wikipedia-web-crawler
Language:Python 543
web-scraping-tutorials
oxylabs / web-scraping-tutorials
Web scraping, data parsing and automation tutorials. Suited for both beginners and intermediate/advanced programmers.
csharp curl golang javascript python r-language ruby web-scraping github-python web-proxies wikipedia-scraper
Language:Jupyter Notebook 40
lehinevych / MediaWikiAPI
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
mediawiki-api python3 wikipedia wikipedia-api wikipedia-crawler wikipedia-sc wikipedia-scraper
Language:Python 34
viralvaghela / Jwiki
Java tool to get wikipedia data
java wikipedia wikipedia-api javawikipeda javatool wikipedia-scraper data-gathering
Language:Java 34
themagicalmammal / wikibot
A :robot: which provides features from Wikipedia like summary, title searches, location API etc.
wikibot wikipedia wikipedia-scraper python heroku telegram-bot-api webhook chatbot wikipedia-library bot-commands rtdb wiki-library firebase flask mit-license telegram-bot pytelegrambotapi telegram-userbot telegram
Language:Python 26
Louis3797 / wikipedia-graph
Graphically display the connections between different Wikipedia articles
force-directed-graphs react react-three-fiber reactjs three-js threejs typescript wikipedia wikipedia-api wikipedia-scraper
Language:TypeScript 19
kohjiaxuan / Wikipedia-Article-Scraper
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
wikipedia-article text-analytics wikipedia-search wikipedia-scraper wikipedia-corpus wikipedia-api wikipedia
Language:Python 17
moesalih / spacex.moesalih.com
SpaceX Launches 🚀 and Starlink Satellites 🛰
spacex mustache firebase wikipedia serverless wikipedia-scraper google-cloud-platform starlink nextjs spacex-launches
Language:JavaScript 16
attogram / justrefs
Just Refs - extract just the references and related topics from any page on the English Wikipedia
wikipedia wikipedia-api wikipedia-viewer wikipedia-scraper data-extraction information-extraction
Language:PHP 15
OlehOnyshchak / pyWikiMM
Collects a multimodal dataset of Wikipedia articles and their images
wikipedia wikipedia-scraper wikipedia-api wikipedia-bot wikipedia-entries wikipedia-dump wikipedia-search wikipedia-viewer wikipedia-corpus wikipedia-page multimodal multimodality multimodal-data multimodal-datasets multimodal-representation multimodal-learning database data-cleaning data-collection data-processing
Language:Python 15
marian-code / wikipedia-music-tags
Music tagger with GUI that parses wikipedia for information. Can also download album art and lyrics.
python-3 music-information-retrieval lyrics-fetcher lyrics-search wikipedia-scraper music-tagger music-tagging album-art pyqt5 pyside2 console-application gui pyinstaller
Language:Python 11
ThiagoNelsi / wikipedia-to-document
This project collects Wikipedia articles from a search term entered by the user and formats the data into a .docx (Word Document) document with images related to each section of the collected article.
wikipedia wikipedia-api wikipedia-scraper microsoft-word microsoft-word-automation ibm-watson ibm google-cloud-platform google-custom-search docx docx-generator algorithmia filipe-deschamps video-maker open-source scraping api automation robot
Language:JavaScript 11
kohjiaxuan / NLP-Model-for-Corpus-Similarity
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
nltk-similarity corpus-similarity nlp-model nlp nlp-machine-learning cosine-similarity similarity-score wikipedia wikipedia-scraper text-analytics
Language:Python 9
emreYbs / Wikipedia-Article-Summarizer
Wikipedia Article Summarizer a simple Python project based on NLP techniques
article-summarization wikipedia-scraper nltk-python nltk nlp python3 jupyter-notebook natural-language-processing nlp-machine-learning python jupyter machine-learning summarization
Language:Jupyter Notebook 8
web-scraping-php
oxylabs / web-scraping-php
A tutorial and code samples of web scraping with PHP
php web-scraping email-scraper email-scraper-with-proxy screen-scraping url-scraper website-crawler wikipedia-scraper
Language:PHP 8
shanedrabing / taxopedia
Taxonomic trees (cladograms) from Wikipedia-scraped data.
taxonomic-trees taxonomy wikipedia wikipedia-scraper cladogram phylogenetics phylogenetic-trees
Language:Python 7
donomii / wikipedia2geojson
Extracts geodata from a wikipedia dump
wikipedia wikipedia-dump wikipedia-scraper json geodata geojson geotagged-wikipedia-articles geotagging converter conversion mapping
Language:Go 5
lorenzoranucci / sentimantic
Linked Data Knowledge Base Population (KBP) framework built on top of Snorkel. The default configuration uses Wikipedia as text corpus and DBpedia as target.
information-extraction knowledge-base-population knowledge-base-construction linked-data-quality-assessment linked-data distant-supervision weak-supervision weakly-supervised-learning docker wikipedia-scraper relation-extraction natural-language-processing nlp
Language:Python 5
mynlp / wikilex
Wikipedia Entities Lexicon Extractor
entity-extraction wikipedia-scraper wikipedia-database disambiguation lexicon
Language:Python 5
ammarfaizi2 / wikipedia_scraper
Wikipedia Scraper written in PHP
wikipedia wikipedia-bot wikipedia-scraper scraperwiki scarpe grabber grabbing-content php-curl curl
Language:PHP 4
orange-soda / scrapy-wikipedia
维基百科中文网历史事件爬取Python实现，并通过LaTeX导出为PDF
python wikipedia-scraper
Language:TeX 4
ankitssh / Wikipedia-Scraper-Bot
A wikipedia scraper bot made in python.
wikipedia-scraper scraper
Language:Python 3
g3ner1c / wikimedia-cli
A minimally dependent Wikimedia CLI
cli python wikimedia wikimedia-api wikipedia wikipedia-cli wikipedia-scraper
Language:Python 3
GeorgeDavila / WikipediaScrapingWikiAPI
Scraping Wikipedia using the python wrapper of Wikipedia's WikiMedia API
wikipedia wikipedia-api wikipedia-scraper scraper nlp nlp-machine-learning
Language:Jupyter Notebook 3
Goutam1511 / WikiFinder
Given a topic name this project finds you the most suitable parent topics for the topic, searching the Wikipedia Category Network (WCN) related to that topic by the help of a statistical approach. It helps you form an autogenerated topic tree for a given topic.
wikipedia-scraper wikipedia-api wikipedia wcn statistical html html-css javascript php python css jquery ajax rest-api
Language:JavaScript 3
Harsh-2909 / Wikipedia-Web-Scraper
A Wikipedia Web Scraper used to download all the text information in a .txt file.
python python3 beautifulsoup beautifulsoup4 webscraper wikipedia-scraper wikipedia webscraping
Language:Python 3
milosmladenovic5 / football_clubs_logo_scraper
Scraping logos of world football clubs from wikipedia
web-scraping wikipedia-scraper python-web-crawler beautifulsoup
Language:Python 3
MostafaAryan / Wikipedia-Article-Title-Translator
🌍 Wikipedia Title Translator is a Chrome extension that displays the translation of any Wikipedia article title in your selected language.
chrome chrome-extension chrome-extensions javascript js wikipedia wikipedia-scraper
Language:JavaScript 3
MrMSDS / wikipedia-infoboxes
This repository contains the query and processing code to support the publication "Wikipedia curation and the US-EPA CompTox Chemicals Dashboard."
wikipedia wikipedia-api epa comptox inchi-key inchikey smiles chemistry cheminformatics data open-data wikipedia-scraper
Language:Java 3
ExploreWiki
Omanshu209 / ExploreWiki
This is a Python - based application that allows the user to search for information and open URLs.
kivymd langchain langchain-python python3 search-engine webbrowser wikipedia wikipedia-api wikipedia-scraper
Language:kvlang 3
web-scraping-r
oxylabs / web-scraping-r
A tutorial for web scraping with R
r-language web-scraping proxy-scraper wikipedia-scraper
Language:R 3
sinjoysaha / Disney-Movies-Wiki-WebScraper
Web Scraping Wikipedia for Disney Movies to create a Disney Movies dataset and then cleaning the data to perform further Data Analysis using the cleaned JSON
beautifulsoup beautifulsoup4 data-cleaning data-science dataset dataset-creation dataset-generation json jupyter jupyter-notebook python web-scraper web-scraping webscraper webscraping wikipedia wikipedia-scraper
Language:Jupyter Notebook 3
VityaSchel / wikipedia-speedrun
Website with interactive game, where you have to travel from random page on Wikipedia to Adolf Hitler's page (or any page specified by you in settings).
speedrun wikipedia wikipedia-api wikipedia-dump wikipedia-scraper wikipedia-speedrun
Language:HTML 3
bhimrazy / web-scraping-using-python
Scrape List of American films of 2022 from wikipedia
python web-scraping beautifulsoup4 wikipedia-scraper
Language:Jupyter Notebook 2
sakinkirti / wikipedia-graph
Scrapes data from Wikipedia and generates a Graph based on inputs
database graph-database neo4j networkx wikipedia-scraper
Language:Python 2
wikiref
zaataylor / wikiref
A web extension that makes extracting, editing, and exporting Wikipedia references easy!
extensions firefox-webextension json wikipedia wikipedia-scraper
Language:JavaScript 2

wikipedia-scraper

martin-majlis / Wikipedia-API

oxylabs / web-scraping-tutorials

lehinevych / MediaWikiAPI

viralvaghela / Jwiki

themagicalmammal / wikibot

Louis3797 / wikipedia-graph

kohjiaxuan / Wikipedia-Article-Scraper

moesalih / spacex.moesalih.com

attogram / justrefs

OlehOnyshchak / pyWikiMM

marian-code / wikipedia-music-tags

ThiagoNelsi / wikipedia-to-document

kohjiaxuan / NLP-Model-for-Corpus-Similarity

emreYbs / Wikipedia-Article-Summarizer

oxylabs / web-scraping-php

shanedrabing / taxopedia

donomii / wikipedia2geojson

lorenzoranucci / sentimantic

mynlp / wikilex

ammarfaizi2 / wikipedia_scraper

orange-soda / scrapy-wikipedia

ankitssh / Wikipedia-Scraper-Bot

g3ner1c / wikimedia-cli

GeorgeDavila / WikipediaScrapingWikiAPI

Goutam1511 / WikiFinder

Harsh-2909 / Wikipedia-Web-Scraper

milosmladenovic5 / football_clubs_logo_scraper

MostafaAryan / Wikipedia-Article-Title-Translator

MrMSDS / wikipedia-infoboxes

Omanshu209 / ExploreWiki

oxylabs / web-scraping-r

sinjoysaha / Disney-Movies-Wiki-WebScraper

VityaSchel / wikipedia-speedrun

bhimrazy / web-scraping-using-python

sakinkirti / wikipedia-graph

zaataylor / wikiref