Dennis Huo (dennishuo)

dennishuo

Geek Repo

Company:Snowflake

Github PK Tool:Github PK Tool

Dennis Huo's repositories

Language:ShellLicense:Apache-2.0Stargazers:1Issues:0Issues:0

dataproc-initialization-actions

Run in all nodes of your cluster before the cluster starts - let's you customize your cluster

Language:ShellLicense:Apache-2.0Stargazers:1Issues:1Issues:0

spark-dataflow

Provides a Spark backend for executing Dataflow pipelines.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

airflow-gcp-examples

Repository with examples and smoke tests for the GCP Airflow operators and hooks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

appengine-flask-skeleton

A skeleton for creating Python applications using the Flask framework on App Engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

bigdata-interop

Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bigtop

Mirror of Apache Bigtop

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

cloud-bigtable-examples

Examples of how to use Cloud Bigtable both with GCE map/reduce as well as stand alone applications.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

codelabs

Codelabs in various languages demonstrating usage of several tools & systems upon genomics data.

License:Apache-2.0Stargazers:0Issues:0Issues:0

hadoop

Mirror of Apache Hadoop

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hbase

Mirror of Apache HBase

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hive

Mirror of Apache Hive

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kaggle-dsb2

Kaggle 2nd annual data science bowl

Language:JavaStargazers:0Issues:0Issues:0

luigi

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

parquet-format

Apache Parquet

License:Apache-2.0Stargazers:0Issues:0Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spark-csv

CSV data source for Spark SQL and DataFrames

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

zeppelin

Mirror of Apache Zeppelin

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

zlib

A massively spiffy yet delicately unobtrusive compression library.

Language:CStargazers:0Issues:0Issues:0