ZubairImtiaz3 / get-remote-companies

A Node.js script that scrapes remote job postings from a specific job search page to find companies hiring from Pakistan.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Get Remote Companies

NodeJS TypeScript

A Node.js script that scrapes remote job postings from a specific job search page to find companies hiring from Pakistan. The script uses Puppeteer for web scraping and stores the data in a JSON file.

Note:

The companies.json file already contains names of companies that have been scraped. This file will be updated as new companies are found during subsequent runs of the script.

Features

  • Scrapes Job Postings: Extracts job data from the provided job search page.
  • Filters by Location: Only includes companies hiring remotely from Pakistan.
  • Detailed Information: Retrieves additional company details including description and number of employees.
  • Data Storage: Saves unique company names in a companies.json file.
  • Technologies

    • Node.Js
    • TypeScript
    • Puppeteer

    Installation & Run Locally

    1. Clone the repository:

      git clone https://github.com/ZubairImtiaz3/get-remote-companies.git
      cd get-remote-companies
    2. Install dependencies:

      npm install
    3. Run the script:

      npm run start

      The script will start scraping job postings and will save unique company names in companies.json.

    How it can be improved

  • Custom Location Input: Allow users to enter any country name and have the script scrape job postings and companies accordingly.
  • Error Handling: Enhance error handling for network issues or page structure changes..
  • Contributing

    Feel free to submit issues or pull requests. Contributions are welcome!

    About

    A Node.js script that scrapes remote job postings from a specific job search page to find companies hiring from Pakistan.


    Languages

    Language:TypeScript 100.0%