climberger / playground-scrapy

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Demo and playground project for web crawling with Scrapy

Following this tutorial:
https://www.youtube.com/playlist?list=PLhTjy8cBISEqkN-5Ku_kXG4QW33sxQo0t

Create Anaconda environment with the environment.yml: conda env create -f environment.yml

Currently, a docker compose is used to deploy a MySQL instance. The mapped port used in the code is not the default MySQL port.

Issues

  • I had following error message while using mysql connector mysql-connector-python: "ModuleNotFoundError: No module named 'mysql'". Maybe it was caused by using mysql-connector-python version 2.0.4. The error was gone after setting up to version 8.0.16
  • The reason might be that Scrapy spider execution does not use the right Anaconda environment. Even when I configure the right environment for PyCharm, I still have to activate this Anaconda environment when executing "scrapy crawl <spiderName>"

About


Languages

Language:Python 100.0%