Shin So's repositories
ambari
Mirror of Apache Ambari
ambari-bootstrap
Collection of tools for bootstrapping Apache Ambari & deploying clusters
awesome-nifi
A list of useful Apache NiFi resources, processor bundles and tools
cdf-workshop
Cloudera CDP/CDF Workshop
cdp-workshop
Cloudera CDP Workshop
cml-demo
Cloudera CML Demo
cml-training
Example Python and R code for Cloudera Machine Learning (CML) training
CML_AMP_Churn_Prediction
Build an scikit-learn model to predict churn using customer telco data.
COPML-AMP-1-telco-churn
Example project 1 for the Cloudera COPML whitepaper
freeipa
Mirror of FreeIPA, an integrated security information management solution
HDP-HDF-workshop
Leveraging Hortonworks' HDP 3.0 and HDF 3.2 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset
hive_tuning
hive
multiple-dimension-spread
Multiple-Dimension-Spread (MDS) is a Schema-less columnar storage format. Provide flexible representation like JSON and efficient reading similar to other columnar storage formats.
nifi-tensorflow-processor
Example Tensorflow Processor using Java API for Apache NiFi 1.2+
nyc-taxi-data
Import public NYC taxi and Uber trip data into PostgreSQL / PostGIS database, analyze with R
presto
Teradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data
redash
Make Your Company Data Driven. Connect to any data source, easily visualize and share your data.
slides-articles-2017fall
JJUG CCC 2017 Fallの発表資料およびブログ記事まとめ
yosegi
Yosegi is a Schema-less columnar storage format. Provide flexible representation like JSON and efficient reading similar to other columnar storage formats.