agoujet / drill-twitter

Repository of using Apache Drill and Flume for twitter

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

drill-twitter

Repository of using Apache Drill and Flume for twitter

This repository includes 2 parts. The first is a Flume package for the MapR Hadoop distribution with a Twitter source and filtering for Twitter data based on keywords and languages, and MapR-FS as sink. The data is stored in native JSON format. The second part is Drill queries for the Twitter JSON data, that can be used to directly query the data as well as to create views of the data. These are meant as templates to be expanded on.

About

Repository of using Apache Drill and Flume for twitter


Languages

Language:Java 82.4%Language:Shell 17.6%