galanteh / GetKavakMexico

This a NiFi processor to run a simple web scrapping to get all the published cars with all the features of the publication so can you can store it and analyze it in a database

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GetKavakMexico

NiFi processor to run a web scrapping on Kavak Mexico and get all the cars published

How it works?

This processor runs a web scrappy over Kavak Mexico and returns all the cars as flowfiles with all the information as Attributes.

Processor working

How it works

Processor

Running

Queue

Queue

Attributes

Attributes

Attributes2

Disclaimer

Please, always check the policy of the Ka.vak Mexico which you are trying to scrape information, this is only for educational purposes to develop on NiFi. If you plan to use it just don't run this software continuously. Avoid harm the website service.

Usage

The material embodied in this software is provided to you "as-is" and without warranty of any kind, express, implied or otherwise, including without limitation, any warranty of fitness for a particular purpose.

Why the methods to obfuscate the website URL?

We published on this site the name of the brand altered with points and dashes or others signs due to the harassment of the brand lawyers who seems not understand the motivations of the open source projects.

About

This a NiFi processor to run a simple web scrapping to get all the published cars with all the features of the publication so can you can store it and analyze it in a database

License:Apache License 2.0


Languages

Language:Java 100.0%