roscopecoltran / scraper

A real-time configuration based HTML/XML/RSS/JSON/CSV to JSON API server [WIP]

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Sniperkit-Scraper - Docker stack

[To do]

Intro

[WIP]

Features

Goals

Quick Start

go get -v github.com/roscopecoltran/scraper
cd $GOPATH/src/github.com/roscopecoltran/scraper
go run *.go ./providers.dev.yaml

Make

(DEV) Scraper
make build
make run
(DIST) Scraper
make dist

Crane

go get -v -u github.com/michaelsauter/crane
(DIST) Scraper
crane up dist
(DEV) Scraper
crane up dev

Docker-Compose

MacOSX:

brew install docker
brew install docker-compose

(DIST) Scraper + ETCD3 / E3CH

Bootsrap:

docker-compose build --no-cache scraper
docker-compose up scraper

Examples:

open http://localhost:3000/bing?query=dlib (bing search endpoint)
open http://localhost:3000/admin (scraper admin)

(DEV) Scraper + ETCD3 / E3CH

Bootsrap:

docker-compose build --no-cache scraper_dev
docker-compose up scraper_dev

(DEV) Scraper + ETCD3 / E3CH + ELK

Bootsrap:

docker-compose build --no-cache scraper_elk
docker-compose up scraper_elk

Examples:

open http://localhost:8086/ (e3ch)
open http://localhost:5601/ (kibana v5.x)

ETCD3 / E3CH

Bootsrap:

docker-compose build --no-cache e3w_dev
docker-compose up e3w_dev

Examples:

open http://localhost:8086/ (e3ch)

go run *.go --debug --verbose ./providers.dev.json

ip="ifconfig en0 | grep inet | awk '$1=="inet" {print $2}'" socat TCP-LISTEN:6000,reuseaddr,fork UNIX-CLIENT:"$DISPLAY" eg. docker run -e DISPLAY=192.168.0.2:0 gns3/xeyes https://stackoverflow.com/questions/37826094/xt-error-cant-open-display-if-using-default-display

socat TCP-LISTEN:6000,reuseaddr,fork UNIX-CLIENT:"$DISPLAY" docker run -e DISPLAY=192.168.0.2:0 jess/geary

RabbitMQ (brew/osx)

Management Plugin enabled by default at http://localhost:15672

Bash completion has been installed to: /usr/local/etc/bash_completion.d

To have launchd start rabbitmq now and restart at login: brew services start rabbitmq Or, if you don't want/need a background service you can just run: rabbitmq-server

NSQ

To have launchd start nsq now and restart at login: brew services start nsq Or, if you don't want/need a background service you can just run: nsqd -data-path=/usr/local/var/nsq

About

A real-time configuration based HTML/XML/RSS/JSON/CSV to JSON API server [WIP]

License:MIT License


Languages

Language:JavaScript 29.3%Language:HTML 17.9%Language:Python 17.6%Language:C++ 15.2%Language:Go 7.4%Language:CSS 5.4%Language:Shell 4.3%Language:CoffeeScript 1.6%Language:CMake 0.6%Language:C 0.3%Language:Makefile 0.2%Language:Batchfile 0.1%Language:M4 0.1%Language:Ruby 0.0%Language:Roff 0.0%