MariyaSha / WebscrapingInstagram

Multiple Notebooks for Web Scraping Instagram with Selenium

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

WebscrapingInstagram with Selenium


This Repository contains a collection of notebooks related to Instagram web scraping and automation.

CURRENT VERSION (THUMBNAIL EXTRACTION)

Please refer to WebscrapingInstagram_completeUpdated_DEC2022.ipynb generated and tested on December 22th, 2022.
This file showcases the updated Selenium commands, which have changed from the moment of filming my YouTube tutorial and now.
FYI, in the new version of Selenium, commands of this syntax:

driver.find_elements_by_tag_name("input")

were replaced with commands of that syntax:

driver.find_elements(By.TAG_NAME, "input")

CURRENT VERSION (IMAGE EXTRACTION)

Please refer to ImageExtracting_Updated-DEC2022.ipynb generated and tested on December 22th, 2022.
This file includes new Selenium syntax, Fixes to scrolling issues and a more efficient keyword search.

OLD VERSIONS

PLEASE NOTE: the notebooks below were not updated to the current Selenium syntax!!!

  • WebscrapingInstagram_completeNotebook: contained 90% automated code for extracting Instagram Thumbnails
    it was working great 2 years ago, now it must be adjusted to the new Selenium syntax.

  • WebscrapingInstagram_starterNotebook: contains the starter files for the Python Simplified tutorial on Youtube:
    https://youtu.be/iJGvYBH9mcY

  • ImageExtracting_bot: contains a 100% automated code for extracting Instagram Images
    as well as ERROR FIXES and WIDER FUNCTIONALITY
    must be adjusted to the new Selenium syntax.

  • Commenting_bot: contains a 100% automated code for commenting on all photos from a certain hashtag presented live with We Are Growth Hackers:
    https://youtu.be/XnEgVZsZgco

About

Multiple Notebooks for Web Scraping Instagram with Selenium


Languages

Language:Jupyter Notebook 100.0%