abourakba / kylo-demo

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

kylo-demo

Introduction

Stock prediction based on news articles and tweets.

Work is based on the work of Eliano Marques

Data ingestion

Companies

The companies directory is composed of:

Stocks

Concerning the stock, you should import the template in Kylo and then create a feed. The directory contains the following files

In case you would like to recreate a file you should use the following request :

https://www.google.com/finance/getprices?q=MMM&i=86400&p=40Y&f=d,c,v,k,o,h,l&df=cpct&auto=0&ei=Ef6XUYDfCqSTiAKEMg

Note that if you want the last quote from the day you should use the following request :

https://www.google.com/finance/getprices?q=GOOG&i=86400&p=2d&f=d,c,v,k,o,h,l&df=cpct&auto=0&ei=Ef6XUYDfCqSTiAKEMg

Some documentation on google financial api may be found on the unofficial documentation.

Twitter

This directory all the stuff related to twitter ingestion :

  • A template to ingest twitter news.
  • A feed to ingest twitter news related to google.

WARNING

Concerning the GetTwitter processor, make sure the clock on the sandbox and the host are synchronized.


yum install -y ntp
service ntpd stop

//At home use 
ntpdate pool.ntp.org

//At teradata office use
ntpdate -vd time01.teradata.com

You may also set these to be automatic:

vi /etc/ntp/step-tickers
//write in the file
time01.teradata.com

// setup the service to run automatically
service ntpd start

News

NewsAPI

Nifi template for metadata ingestion for news articles is provided with NewsAPI.

NewsAPI provides over 70+ sources, for which we query one by one.

Directory composition:

  • Template Template-NewsAPI-Ingest.xml to query for sources and articles. Input is generated automatically to query NewsAPI about current article sources, then it queries for articles on each source. Output is FlowFiles as an article metadata.

Yahoo News

Service no longer available

Financial Times

Service is not free

Visualization using Kibana

Elastic is already installed in Kylo (port 9200 and port 9300).

If you want to have a description of your index:

curl -XGET 'localhost:9200/_cat/indices?v&pretty'

To setup Kibana you should follow this tutorial, main steps are :

  • Download and install Kibana as a service.
  • use Nginx as a reverse proxy.

You could use the port 8989 for kibana.

In the configuration file, instead of using example.com you may use sandbox.kylio.io.

Documentation

In order to update this document you may use this markdown-cheatsheet.

About