There are 4 repositories under data-modelling topic.
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
dbt + Metabase integration
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, standardised structure for data and ML and parallel processing out-of-the-box.
Cool DE Projects
This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL for Data Loading and Staging ➲Airflow for Data Orchestration and Monitoring ➲PowerBI for Reporting
🎥 Email marketing campaign analysis
COVID-19 Surveillance Data Modelling and Management Pipeline in Piedmont.
This repo covers the processes of designing a database by performing logical, conceptual and physical data modelling processes, creating the designed database using DML and DDL on various database server systems and performing SQL queries on the created database.
Repository with files that I worked upon during the DBS211 (Introduction to Database Systems) course.
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Data model for the Participatory Knowledge Practices in Analogue and Digital Image Archives (PIA) project
A Udacity Power BI project on an online clothing store
Data Visualization for Atliq Hardware sales
This repo contains HR Analytics project to analyze what factors impact employee attrition using dataset for Atlas Labs Company.
Formula 1 race data engineering project which utilises azure services and databricks to ingest and analyse the data.
This Repository consist of all the Jupyter Notebooks, Images and .CSV files of the tasks that were assigned during the Cognizant Artificial Intelligence Course hosted on Forage
Employee data analysis using sql queries and table schemata
This project accompanies a Power BI data visualization project focused on analyzing various aspects of a telecom company's operations. It provides guidance and resources for creating three distinct dashboards:
To help an organisation to improve the employee performance and to improve employee retention (reduce Attrition) by creating a HR Analytics Dashboard Using Power BI.
Artificial Intelligence Virtual Experience Program
A Common Lisp model for modelling models: entity, association, generalization, subject, type, etc.
Sales Insight Project for Atliq Hardware using Power BI and SQL
repo for BDBT course project in Winter '21 semester
⚙️ ETL pipeline on AWS using S3 and Redshift
Built employee database using SQL by applying data modeling, engineering and analysis skills. I prepared an entity relationship diagram (ERD). I used PostgreSQL and pgAdmin to write SQL scripts to build tables, added joins between tables, imported data from CSV files, and wrote custom reports.
Modeling the JSON data using Postgres
This project carried out as the final capstone project of the Udacity Data Engineering nanodegree program. It involves Extracting, Loading, and Transforming of datasets of different file formats from the web (downloadable,), to the lake (S3), and then the warehouse (Redshift)
Designing MongoDB database for Zen class programme, covering users, codekata, attendance, topics, tasks, company drives, and mentors as part of a Day-36 task from GUVI Zen class.