WesGtoX / cialdnb

A command line application to extract data from websites.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CIALdnb

About  |   Overview  |   Technology  |   Getting Started  |   Roadmap  |   License

GitHub top language GitHub language count GitHub last commit GitHub issues License

About the Project

CIALdnb is a command line application, which through a list of URL's, extracts the logo data, all the phone numbers on the page, and the link of the visited website.

Overview

Technology

This project was developed with the following technologies:

Getting Started

Prerequisites

  1. Clone the repository:
git clone https://github.com/WesGtoX/cialdnb.git
cd cialdnb

Install and Run via Docker

  1. Build the docker image:
docker build . -t run_cialdnb
  1. Run:
cat websites.txt | docker run -i run_cialdnb
  1. Can also be run via Makefile:
make build
make run

Install and run via Python

  1. Install the dependencies:
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
  1. Run:
cat websites.txt | python -m run_cialdnb

Roadmap

See the open issues for a list of proposed features (and known issues).

License

Distributed under the MIT License. See LICENSE for more information.


Made with ♥ by Wesley Mendes 👋

About

A command line application to extract data from websites.

License:MIT License


Languages

Language:Python 94.7%Language:Dockerfile 2.6%Language:Makefile 2.2%Language:Shell 0.5%