Garren Staubli (gstaubli)

gstaubli

Geek Repo

Company:@databricks

Location:Greater Seattle Area, WA

Home Page:http://www.garrens.com/blog

Github PK Tool:Github PK Tool

Garren Staubli's repositories

csv2avro

Command line script to convert CSV/TSV files to AVRO

Language:PythonLicense:MITStargazers:6Issues:2Issues:1
Language:PythonStargazers:1Issues:0Issues:0

increments

A gem to facilitate incrementing values

Language:RubyLicense:MITStargazers:1Issues:2Issues:1

meta_func

Python decorator function to track metadata on function calls

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

runtime_stats

Python decorator function to track runtime stats on function calls

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

AzureDatabricksBestPractices

Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs

License:CC-BY-4.0Stargazers:0Issues:1Issues:0

hive_metadata_utils

Find Hive Tables by Table or Column Names

Language:ScalaStargazers:0Issues:2Issues:0

pndb

Pseudo-Normalized Database Engine Proof of Concept

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

pyspark-intro

Intro to PySpark codebase

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

pyspark-nlp

Using PySpark with Natural Language Processing (NLP) and Machine Learning (ML)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

split_file_by_key

Given a *SORTED* file, delimiter, and key(s), split the file into numerous out files based on the key(s).

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Workshops

Training Workshop Code & Materials

Language:Jupyter NotebookStargazers:0Issues:2Issues:0