grtushar / google_play_scraper

An updated version of google play scraper with the addition of permission list information.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

google_play_scraper

An updated version of google play scraper with the additional permission list information retrieving capability.

The permission information of an app in the google play store resides inside a modal window. That's why the permission information doesn't exist in the initial html. Also It's not a real link, it does not redirect to another page as classic a tags would (it doesn't have the href attribute anyway). Instead, there is a listener somewhere that opens a popup when the user click on the "View details" under the permission category.Therefore, the google play scraper can't fetch the permission data.

This updated version of google play scraper perform the click event on the "view details" under permission category using selenium and chrome-driver

Usage

To get an idea about the api documentations please follow this link.

And to configure the program to get permission information you need update the variables in the permission_info.py.

Download the chrome-driver and then set the CHROME_DRIVER_PATH variable of permission_info.py file with the path of the chrome-driver.

Sometimes the information doesn't load, so there is a retry mechanism. You can limit the max retry by setting MAX_ATTEMPT variable to your desired retry count.

In addition, to give some time to chrome to load the data of the url the program sleeps for sometime. You can also configure the sleep interval depending on your internet speed by setting the required value TIME_TO_LOAD_DATE_IN_SECOND and TIME_TO_LOAD_MODAL_DATA_IN_SECOND variables inside the permission_info.py file.

About

An updated version of google play scraper with the addition of permission list information.

License:MIT License


Languages

Language:Python 100.0%