darideveloper / pinterest-post-bot

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MIT License LinkedIn Fiverr Gmail


Logo

Europeanstartups Scraper

Python scraper for extract data from the page https://app.europeanstartups.co/companies.startups/f/data_type/anyof_Verified/regions/allof_European%20Union, using python, and a google chrome data with a premium account already logged.
Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Roadmap
  5. Contributing
  6. License
  7. Contact

About The Project

Web page Screenshot

The project extract all results from the page https://app.europeanstartups.co/companies.startups/f/data_type/anyof_Verified/regions/allof_European%20Union, and save the output data in a csv file.

The project is a python script, that use a google chrome data with a premium account already logged, to extract the data from the page.

Built With

python selenium

Getting Started

To get a local copy up and running follow these simple example steps.

Prerequisites

Installation

  1. Clone the repo
    git clone https://github.com/darideveloper/europeanstartups_scraper.git
  2. Install python packages (opening a terminal in the project folder)
    python -m pip install -r requirements.txt 
  3. Create a .env file in the project folder.
  4. Update your chrome path in the .env file (note: the chrome path is the folder where chrome data its installed)
    CHROME_PATH = C:\Users\<<your-user-name>>\AppData\Local\Google\Chrome\User Data

Usage

  1. Go to https://app.europeanstartups.co/companies.startups/f/data_type/anyof_Verified/regions/allof_European%20Union and create an account (if you have problems with your email, try with a proton email)
  2. Activate the premium trial or buy a premium account
  3. be sure to keep the account logged in the browser.
  4. Open a terminal in the project folder
  5. Run the project folder with python:
    python .
  6. Wait until the script finish, and check the output.csv file in the project folder (note: while the script its running, you can't use google chrome).

Roadmap

  • Use chrome data fro avoid login in the page
  • Extract all data from the page
  • Save output data in csv file

See the open issues for a full list of proposed features (and known issues).

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Darideveloper - @developerdari - darideveloper@gmail.com.com

Project Link: https://github.com/darideveloper/europeanstartups_scraper

Last code update: 2022-11-29

About

License:MIT License


Languages

Language:Python 92.9%Language:Sass 4.5%Language:HTML 2.5%Language:Batchfile 0.1%