puksh / olxScrapper

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OLXScrapper

image

OLXScrapper is a simple web scraper designed to enhance search capabilities for the OLX website. It allows you to search and extract relevant information from the OLX platform using a given query, providing a basic foundation for a better search engine.

Features

  • Scrape search results from OLX
  • Uses Cheerio for HTML parsing
  • Utilizes Axios for HTTP requests
  • Manages concurrent requests with p-queue

Getting Started

Prerequisites

Ensure you have the following installed on your machine:

  • Node.js (v14.x or higher)
  • npm (comes with Node.js)

Installation

  1. Clone the repository:

    git clone https://github.com/puksh/olxscrapper.git
  2. Navigate into the project directory:

    cd olxscrapper
  3. Install the dependencies:

    npm install

Running the Server

Once the dependencies are installed, you can start the server:

npm start

This will run the server on your local machine. You can then send queries to the server to scrape results from OLX.

Usage

You can search up anything through the search bar on the website

Dependencies

  • axios: HTTP client for making requests.
  • cheerio: Fast, flexible, and lean implementation of core jQuery for parsing HTML.
  • express: Web framework for Node.js.
  • p-queue: Promise queue with concurrency control.

Author

  • Pukash - Creator and maintainer.

License

I don't know what license I should use but I think it's probably MIT

About


Languages

Language:JavaScript 62.5%Language:HTML 29.3%Language:CSS 8.2%