Dreffed / haystack

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Haystack Web Collector Suite

This suite is undergoing conversion to Python 3+

Haystack is a tool used to harvest data from a variety of web sites and will push the data to an underlying database (Maria DB) This is an initial working program started in 2009, a newer version called Peregrin is under development.

Getting Started

To use Haystack

  • create a db schema (schema/peregrin.mwb)
  • Each Module can be run as a standalone instance, see self runner code in wach module

Prerequisites

Mariadb or MySQL installed and a database called Peregrin setup Imstall the following modules:

  • pip install configparser
  • pip install mysqlclient

_ some modules use other libraries, please review includes imports in each module _

Authors

  • David Gloyn-Cox - Initial work - Dreffed

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Acknowledgments

  • This program was written to scratch an itch I had while looking for gainful employment

About


Languages

Language:Python 100.0%