philipperemy / japanese-street-addresses-scraper

Scraper for Japanese street addresses (住所).

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Japanese Street Addresses Scraper

Scraper for Japanese street addresses (住所). From itp.ne.jp.

Some figures

  • 7,225,873 is the potential number of distinct postal addresses listed on the Japanese yellow pages.
  • 12 is the number of days it took to retrieve them all, using VPN and IP auto switching.

Script Requirements

  • Python 3.5+
  • numpy
  • expressvpn_python - if you plan to use the VPN mode.
  • requests
  • natsort
  • beautifulsoup4
  • unicode_slugify

Usage

https://github.com/philipperemy/japanese-street-addresses-scraper.git
cd japanese-street-addresses-scraper
pip3 install -r requirements.txt
./main.sh # it starts the scraping!

About

Scraper for Japanese street addresses (住所).


Languages

Language:Python 97.5%Language:Shell 2.5%