Rahul Gupta's starred repositories
introduction-to-bash-scripting
Free Introduction to Bash Scripting eBook
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
data-science-interview-questions-and-answers
Data science interview questions with answers. Not ideally (yet)
JuliaTutorials
Learn Julia via interactive tutorials!
pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
pyspark-examples
Pyspark RDD, DataFrame and Dataset Examples in Python language
learning-apache-spark
Notes on Apache Spark (pyspark)
sql-spark-connector
Apache Spark Connector for SQL Server and Azure SQL
feature-engineering-tutorials
Data Science Feature Engineering and Selection Tutorials
Machinehack-Power-Plant-Energy-Output-Prediction
The dataset was collected from a Combined Cycle Power Plant over 6 years (2006-2011) when the power plant was set to work with a full load. Features consist of hourly average ambient variables Temperature (T), Ambient Pressure (AP), Relative Humidity (RH), and Exhaust Vacuum (V) to predict the net hourly electrical energy output (PE) of the plant.
Bank-Promotional-Compaigns-Strategy-Classification
To Classify whether the customer subscribed a term deposit or not
hci-papers-summaries
Summaries of papers related to human computer interfaces, especially in relation to machine learning methods.
Web-scrapping-Of-zomato-Website
Scrapping of data from the zomato site and saving it in excel sheet.
MachineHack-metalfurnace-classification
Metal Furnace Challenge : Weekend Hackathon #1. Given are 28 distinguishing factors in the manufacturing of an alloy, your objective as a data scientist is to build a Machine Learning model that can predict the grade of the product using these factors.
COVID-19-data-visualization-and-time-series-analysis-with-animated-charts-using-plotly-on-Kaggle
COVID 19 data visualization and time series analysis with animated charts using plotly.
Machinehack-Cardiac-Risk-Prediction
Plugin Hackathon : Cardiac Arrest Risk Prediction. Your objective as a data scientist is to build a machine learning model that can predict if a patient is likely to have a cardiac arrest or not.
Machinehack-E-Commerce-product-price-prediction
E-commerce Price Prediction: Weekend Hackathon #8. Given are 7 distinguishing factors that can influence the price of a product on an e-commerce platform. Your objective as a data scientist is to build a machine learning model that can accurately predict the price of a product based on the given factors.
Pyspark-Theory-and-Code-Basics
Pyspark serves as a Python interface to Apache Spark, enabling the execution of Python and SQL-like instructions for the manipulation and analysis of data within a distributed processing framework.