khangt1k25 / EPLcrawler

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

EPL project

Getting started with scrapy crawler

  • Requiremnts:
    • Python
    • Scrapy
    • Git clone repo
    • Cd to repo
  1. PlayerOfClub
scrapy crawl playerofclub -o playerofclub_2122.json

change compSeasons in URL in PlayerOfClub.py to id of other seasons and change command line to output playerofclub_xxyy.json [Please read seasonid to get id]

  1. Player
scrapy crawl playerofclub -o player_2122.json

change compSeasons in URL in PlayerOfClub.py to id of other seasons and change command line to output player_xxyy.json [Please read seasonid to get id]

  1. Ranking
scrapy crawl rank -o rank.json

This script run only 1 time to crawl ranking tables of all seasons.

  1. Stats for club in specific seasons
scrapy crawl clubstats -o clubstats_2021.json

Analysis report

Report

About


Languages

Language:Jupyter Notebook 98.0%Language:Python 2.0%