Mallik-G's repositories
Airbnb-ETL-pipeline-Spark-on-EMR-Redshift-Airflow
A project to learn a bit of Spark, Airflow and Redshift
Airbnb-ETL-pipeline-Spark-Scala-in-Databricks-Airflow
A project to learn Spark with Scala, Databricks & Airflow
azure-blob-storage-fastapi
Azure Blob Storage con FastAPI
cloud-guardrails
Rapidly apply hundreds of security controls in Azure
data-engineering
Getting Started with Data Enngineering
databricks-eventhub-demo
Structured Streaming demo with Event Hub
databricks-lakehouse
A 1 hour workshop running through the data lakehouse and deep dive into delta lake
databricks-platform-administration-framework
Framework for end to end databricks platform administration using YAML. Includes cluster management, ACL permissions, and wrapper for SCIM & Permissions REST API.
dataload
Example code demonstrating how to use the Janrain /entity.bulkCreate API to bulk load user profiles into the Janrain platform.
de-apache-spark
Data Engineering com Apache Spark
delta-lake-pipeline
Delta lake pipelines in Databricks lakehouse with Spark structured streaming and batching jobs
diagrams
:art: Diagram as Code for prototyping cloud system architectures
FastAPI-course-content
Content and resources for the VideoLab FastAPI course.
ML-For-Beginners
12 weeks, 24 lessons, classic Machine Learning for all
overwatch
Capture deep metrics on one or all assets within a Databricks workspace
Pyspark_Questions_SKS
This repo is mostly created for pyspark and hive related interview questions.
SlowlyChangingDimensionsInDeltaLake
Slowly Changing Dimensions In Databricks Delta Lake
synapse-cicd
A GitHub Actions/Azure DevOps implementation of a CI/CD workflow for Azure Synapse or Azure SQL Database
TeradataExportScripts
Some sample scripts that are very helpful to extract the DDLs from your Teradata database
trainingdays
Azure Developer College's application development training days content.