vlandeiro / LoadGamesFromPcGamesDownloadCom

oadGamesFromPcGamesDownloadCom is written and tested in Python 3.6.5. Its purpose is to load all posts in a structured way from https://pcgames-download.com/.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LoadGamesFromPcGamesDownloadCom

LoadGamesFromPcGamesDownloadCom is written and tested in Python 3.7.1. Its purpose is to load all posts in a structured way from https://pcgames-download.com/.

Build status GitHub issues GitHub forks GitHub stars GitHub license

How does it work:

The programm scrapes the last page index from the website https://pcgames-download.com/ and iterates over all available pages to collect all posts. The data is saved as a CSV file under the savedData subfolder).

Staged data / raw data will be loaded to the loading subfolder and overwritten.

The programm doesn't use threading because the website otherwise might block the IP address of the scraper in respect to DDoS attacks.

Loading the data might take a while ⏳, so have a ☕ or two ☕ ☕ or do something else in parallel: 📱, ☎️, 📺, 💻, 🚽

How do you start the project:

python LoadGames.py

What else do you need:

BeautifulSoup 4

The result:

The result is a .csv file for the current status of the games: 2018-05-13T19-02-26-141897.csv.

Change history

  • Version 1.0.0.0 (2018-05-13) : 1.0 release.

About

oadGamesFromPcGamesDownloadCom is written and tested in Python 3.6.5. Its purpose is to load all posts in a structured way from https://pcgames-download.com/.

License:GNU General Public License v3.0


Languages

Language:HTML 100.0%Language:Python 0.0%