Sabyasachi Dasgupta's repositories
ApacheSpark
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer.
ARIMA
Simple python example on how to use ARIMA models to analyze and predict time series.
azure-docs
Open source documentation of Microsoft Azure
AzureDatabricksBestPractices
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
BigData-HW-Spark
Spark exercises: Spark RDD, SparkSQL, Spark ML pipelines, Spark in Cloud(AWS)
CanadaPubSecALZ
This reference implementation is based on Cloud Adoption Framework for Azure and provides an opinionated implementation that enables ITSG-33 regulatory compliance by using NIST SP 800-53 Rev. 4 and Canada Federal PBMM Regulatory Compliance Policy Sets.
Clustering1D-Data
Clustering-Excercise-1D-Data
Databricks-Apache-Spark-2X-Certified-Developer
Databricks - Apache Spark™ - 2X Certified Developer
DatabricksContent
Examples surrounding Databricks.
db-delta
Enablement and examples as they relate to Delta Lake for both the Open Source and Databricks Implementation.
MachineLearningNotebooks
Python notebooks with ML and deep learning examples with Azure Machine Learning | Microsoft
mlflow
MLflow-learning
mlops-queens-university
Big Data and MLOps content presented at Queen's University MMAI Workshop
MS-DA202.1
MS-DA202.1 | Data Cleaning with Python
Product-Recommendation-for-Online-Grocery-WebAPP
Agile software development for a WebApp that prompts users to search, purchase products, and then recommends two items frequently bought together.
sabyadg.io
Personal webspacce
spaCy
💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
spark-training
Repository used for Spark Trainings
sparkprophet
Sample application running fbprophet using spark
time_series_models
Exploring-time-series-data