Sadha Chilukoori's repositories
dbldatagen
Generate relevant data quickly for your projects. The Databricks data generator can be used to generate large simulated / synthetic data sets for test, POCs, and other uses
superset
Apache Superset is a Data Visualization and Data Exploration Platform
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
spark
Apache Spark - A unified analytics engine for large-scale data processing
mooc-certificates
course completion certificates
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Algorithms
A collection of algorithms and data structures
cheatsheets
Posit Cheat Sheets - Can also be found at https://posit.co/resources/cheatsheets/.
resume_tex
resume
Python_algos
All Algorithms implemented in Python
tech-interview-handbook
💯 Curated coding interview preparation materials for busy software engineers
scala-style-guide
Databricks Scala Coding Style Guide
maven-scala-seed.g8
A Giter8 template for a sample Scala project using the Maven build tool!
milewski-ctfp-pdf
Bartosz Milewski's 'Category Theory for Programmers' unofficial PDF and LaTeX source
gt-nlp-class
Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"
dag-factory
Dynamically generate Apache Airflow DAGs from YAML configuration files
sqloxide
Python bindings for sqlparser-rs
creative-scala-template
Template for those following Creative Scala
py4fi
Python for Finance (O'Reilly)
JuliaAcademyMaterials
Assets and Infrastructure for JuliaAcademy.com
ScalaCookbook2Examples
Source code examples for the Second Edition of the Scala Cookbook
ge_tutorials
Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.