Garren Staubli's repositories
increments
A gem to facilitate incrementing values
runtime_stats
Python decorator function to track runtime stats on function calls
AzureDatabricksBestPractices
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
hive_metadata_utils
Find Hive Tables by Table or Column Names
pyspark-intro
Intro to PySpark codebase
pyspark-nlp
Using PySpark with Natural Language Processing (NLP) and Machine Learning (ML)
split_file_by_key
Given a *SORTED* file, delimiter, and key(s), split the file into numerous out files based on the key(s).