Kulraj Singh Kohli's repositories
Exercise-Tracker
MERN Stack
Hospital-Database-Management-System
The purpose of this database is to maintain the data used to support an active hospital management system including creating, tracking, searching and reporting the data included in the database. The database includes data/ information about the patients – incoming and outgoing, their diagnosis, test results, room allocated, their appointment booking, payment information and insurance information. It also contains data on the doctor and medical staff in the hospital. Furthermore, it will also include data on the hospital inventory status. It will be used by administrative staff of the hospital to keep track of the doctor, medical staff and patient information
Python-Data-Pipeline-for-Snowflake-
Generalized code for doing pulling and pushing data to snowflake using python
Apache-Airflow
Setting up and doing projects with apache airflow
Apache-Nifi
Setting Up and Pipelines using Nifi
Elastic-Search
Setting up Elastic Search and performing data pipelining and transformation.
Data-Engineering-with-Python
Repository for Beginner codes on how to read and write files
PostgreSQL
Installing and configuring PostgreSQL & pgAdmin 4
airflow-2.4-cloud-composer-1
POC repo for working of airflow using google cloud composer
Collecting-and-Merging-Data
Joining,Extracting and Inspecting the Police Beats Dataset
dagster-multidocker-deployment
A sample dagster deployment using a multi-docker set up
Data-Manipulation-Using-R
Using tidyverse to explore, formulate questions and visualize the NYC Flights Data
DatabaseMigration
MySQL to Postgres Migration using Python Pandas Pipeline
DBT-Training
DBT Training
Exploratory-Analysis-of-Data
Formulation of Questions, Exploratory Analysis and Basic Visualization of NYC Flights data on R
hello-world
Just my first repository
Infowarriors-Davidson
Text Mining using Robotic Process Automation & NLP
Jobs-Scrape-Selenium
Indeed website scarping job using selenium
MachineLearningNotebooks
Python notebooks with ML and deep learning examples with Azure Machine Learning | Microsoft
Pipelines-to-Cloud-Services
Building Pipelines with cloud services
Probability-and-Distribution-Using-R
R Markdown HTML file with Bayes Theorem Normal Distribution and Mean Median and Mode
Probability-and-Distributions-using-R
You are hired by Air Nowhere to recommend the optimal overbooking rate. It is a small airline that uses a 100-seat plane to carry you from Seattle to, well, nowhere. The tickets cost $100 each, so a fully booked plane generates $10,000 revenue. The sales team has found that the probability, that the passengers who have paid their fare actually show up is 98%, and individual show-ups can be considered independent. The additional costs, associated with finding an alternative solutions for passengers who are refused boarding are $500 per person.
PySpark-Functions
Built In Functions | User Defined Functions | Joins | Dates
PySpark-Importing-Data
Download and install Spark | Setup environment | Downloading and preprocessing Chicago's Reported Crime Data | Schemas
PySpark-RDD-s
Working with RDD's
SQL-Window-CTE-CASE
Example of Window functions, CTEs and CASE Functions using Adventure Works
Windows-Subsytem-for-Linux-Tips-and-Tricks
Things to get around in the Linux environment on Windows