There are 0 repository under residual-analysis topic.
The goal of this project is to build multiple linear regression models for the prediction of car prices.
Assignment-05-Multiple-Linear-Regression-2. Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in the past few years State -- states from which data is collected Profit -- profit of each state in the past few years.
Multi-Linear-Reg
Supervised-ML---Multiple-Linear-Regression---Toyota-Cars. EDA, Correlation Analysis, Model Building, Model Testing, Model Validation Techniques, Collinearity Problem Check, Residual Analysis, Model Deletion Diagnostics (checking Outliers or Influencers) Two Techniques : 1. Cook's Distance & 2. Leverage value, Improving the Model, Model - Re-build, Re-check and Re-improve - 2, Model - Re-build, Re-check and Re-improve - 3, Final Model, Model Predictions.
Residual analysis in Linear regression is based on examination of graphical plots which are as follows :: 1. Residual plot against independent variable (x). 2. Residual plot against independent variable()y. 3. Standardize or studentized residual plot 4. Normal probability plot
To model the demand for shared bikes with the available independent variables
Prediction of Miles per gallon (MPG) Using Cars Dataset
A MATLAB program related to Regression Models
Predicting the Likelihood of Diabetes Using Common Signs and Symptoms - About one-third of patients with diabetes do not know that they have diabetes according to the findings published by many diabetes institutes around the world. Detecting and treating diabetes patients at early stages is critical in order to keep them healthy and to ensure their quality of life is not compromised. Early detection will also help to mitigate the risk of serious complications like heart disease & stroke, blindness, limb amputations, and kidney failures as a result of diabetes. The data set consists of signs and symptoms of 516 newly diabetic or would be diabetic patients, who presented at Sylhet Diabetes Hospital in Sylhet, Bangladesh. The data had been collected using the direct questionnaires method at the hospital under the supervisor of Doctors. The Source for the data set is the UCI Machine Learning Repository at, https://archive.ics.uci.edu/ml/datasets/Early+stage+diabetes+risk+prediction+dataset. The data set has 16 descriptive features and one target feature. This study intends to build a logistic regression model to predict the likelihood of having diabetes using common signs and symptoms presented by patients. A successful model will enable early detection of diabetes through signs and symptoms shown by possible patients. This study consists of two phases: 1) Phase I - preprocess and explore the data set in order to make it ready to consume for model development. 2) Phase II - build a logistic regression model to predict the likelihood of having diabetes based on signs and symptoms. The Phase I part has already been completed under previous work/submission and this report intends to cover the work carried out for Phase II. All the activities have been performed in the R package and the report has been compiled using R-Markdown.
Supervised-ML---Multiple-Linear-Regression---Cars-dataset. Model MPG of a car based on other variables. EDA, Correlation Analysis, Model Building, Model Testing, Model Validation Techniques, Collinearity Problem Check, Residual Analysis, Model Deletion Diagnostics (checking Outliers or Influencers) Two Techniques : 1. Cook's Distance & 2. Leverage value, Improving the Model, Model - Re-build, Re-check and Re-improve - 2, Model - Re-build, Re-check and Re-improve - 3, Final Model, Model Predictions.
The "Advertising Impact Analysis" project aims to analyze the relationship between advertising expenditure across different channels (such as TV, radio, online) and its impact on sales or revenue.
Used libraries and functions as follows:
A boat-sharing system is a service in which boats are made available for city tour. Required to model the demand for open boats with the available independent variables. It will be used by the management to understand how exactly the demands vary with different features. They can accordingly manipulate the business strategy to meet the demand levels and meet the customer's expectations. Further, the model will be a good way for management to understand the demand dynamics of a new market.
Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t
The goal of the report was to fit the linear regression model to the data and check whether the data met the assumptions of the model. The results were used to make predictions.
Anomaly detection for building HVAC data.
Practices and Assignments from the Data Analysis and Regression Class
Topic : Predicting Medal Counts by Countries in Upcoming Olympics Games // # Integrated historical Olympic data with demographic, health, and economic datasets to generate a large dataset # Developed a linear regression model displaying prediction accuracy close to 70%, within the margin of error
Feasibility of staring a Sunday edition for a large Metroplitan newsapaper
This repository contains a project I completed for an NTU course titled CB4247 Statistics & Computational Inference to Big Data. In this project, I applied regression and machine learning techniques to predict house prices in India.
Time Series Forecasting