datapao

datapao

Geek Repo

Apache Spark, Big Data Infrastructure and AWS + Trainings

Home Page:http://datapao.com

Github PK Tool:Github PK Tool

datapao's repositories

dac

Databricks Admin Center

Language:PythonLicense:Apache-2.0Stargazers:6Issues:8Issues:12

training-feed-kinesis

Manage Multiple Kinesis accounts from the same computer.

Language:PythonStargazers:3Issues:1Issues:0

bigdata-training

Datapao Big Data and Hadoop Training materials

Language:Jupyter NotebookLicense:GPL-2.0Stargazers:2Issues:2Issues:0

budapest-data-community

Budapest data community in numbers

Language:Jupyter NotebookStargazers:2Issues:2Issues:0

serverless-logging

AWS Serverless logging

Language:PythonStargazers:1Issues:3Issues:0

streaming-format-benchmarks

Check Go vs Python and JSON vs Avro in Reading / Writing streaming data.

Language:GoLicense:MITStargazers:1Issues:3Issues:0

wilson

Six-Sigma rules on pySpark Dataframes

Language:PythonLicense:Apache-2.0Stargazers:1Issues:4Issues:0

conda-hands-on

Simplest Python package and Recipes for building and installing with Conda, Pip and Setuptools.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

dask-for-ml

PoC for using Dask Worker Resources as simple Deep Learning training Resource Manager

Language:PythonStargazers:0Issues:2Issues:0

dlt-dev-with-dab

Demonstration of development lifecycle with Databricks Delta Live Tables and Databricks Asset Bundles

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

jaffle_shop

A self-contained dbt project for testing purposes

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0