Apfirebolt / advanced-gui-based-web-scrappers-in-pyqt

A collection of advanced GUI I written for personal use in Pyside6/Pyqt6

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

A Collection of web-scrappers with GUI written in Pyside6/PyQt6

This would have multiple GUI programs to scrap pages using PySide6 and BeautifulSoup.

Description

GUI based advanced web-scrappers. Scrapping can be applied to many use cases for instance, downloading images from a given web-page.

List of Programs

1. GUI Based Scrapper created in Python using PySide6

This you can find in the file named 'gui_pyqt6_scrapper.py'. You can scrap a given page using multiple web-agents. It is important to keep switching agents to avoid getting blocked by websites under suspicion. This scrapper allows you to search for given tags in the page for example, you can search for all occurrences of h1 tag, p tag and so on. Additionally, you can download all the images from a scrapped page in a given folder which you get to choose.

Getting Started

  • Create a Virtual Environment using Python
  • Install packages in requirements.txt

That's it, ready to run the program. Pyside6/PyQt6 are bulky packages which can consume upto 1 GB on your system but they're like very feature rich and advanced compared to Tkinter.

Dependencies

  • Any operating system with Python 3 installed should work.
  • Pyside6/PyQt6
  • BeautifulSoup
  • Requests
  • Generic familiarity with web scrapping

Executing program

  • Simply run the file once virtual environment is activated and relevant packages are installed.
python gui_pyqt6_scrapper.py

Authors

Contributors names and contact info

ex. Apfirebolt - @amitprafull

License

This project is licensed under the MIT License - see the LICENSE.md file for details

About

A collection of advanced GUI I written for personal use in Pyside6/Pyqt6

License:MIT License


Languages

Language:Python 100.0%