There are 1 repository under multicollinearity topic.
Small example on how you can detect multicollinearity
This is an attempt to summarize feature engineering methods that I have learned over the course of my graduate school.
Quadratic programming feature selection
This repository shows how Lasso Regression selects correlated predictors
A simple example to show how Principal Component Analysis can be used to Address Multicollinearity
Machine-learning models to predict whether customers respond to a marketing campaign
Detailed implementation of various regression analysis models and concepts on real dataset.
R package to manage multicollinearity in modeling data frames.
Linear regression on numerical attributes
The main objective of this project is to build a model to identify whether the delivery of an order will be late or on time.
Android malware detection using machine learning.
A Regression Exercise covering OLS & Ridge Regression
INN Hotels Project
This project aims to build a regression model that predicts the number of views for TED Talks videos on the TED website.
Analyze the data of INN Hotels to find which factors have a high influence on booking cancellations, build a predictive model that can predict which booking is going to be canceled in advance, and help in formulating profitable policies for cancellations and refunds.
This repo implements a machine learning model to predict real estate prices in Mexico City. It preprocesses data, incorporates one-hot encoding, imputation, and Ridge regression, achieving accurate price approximations.
To model the demand for shared bikes with the available independent variables
Statistical Multivariate Regression Analysis to determine the effects of mortality, economic and social factors on life expectancy.
ML | Regression Analysis| Random Forest| XGBoost| Gradient Boost| EDA| Feature Engineering| Feature selection
Analyzing Multicollineaerity with a little simulation
In this repo I have implemented a machine learning project which predicts the house price in Boston. I have covered these topics : Exploratory Data Analysis, Feature Engineering including feature scaling, transformation into normally distributed data, multicollinearity, feature selection. I have trained the dataset using Linear Regression, Ridge, Lasso, and Elastic Net Regression.
Traditional Regression problem project in Python
The main objective is to build a predictive model that predicts whether a new client will subscribe to a term deposit or not, based on data from previous marketing campaigns.
RStudio project utilizing various statistical methods to replicate and diagnose the findings of Appel and Loyle from their study on post-conflict justice and foreign direct investment.
Python with Tableau
Classification problem using multiple ML Algorithms
This project is about to use linear regression to examine the relationship between various economic variables and the mortgage rate in the United States.
This repository contains the code and data necessary to reproduce the results presented in the paper "Ridge Regularization for Spatial Auto-regressive Models with Multicollinearity Issues" submitted to Advances in Statistical Analysis (AStA).
Basic methodologies of Empirical Research applied on various case studies (R language)
Usual linear regression or XGBoost? Combo! Or how I was investigating the impact of intellectual capital on NASDAQ-100 capitalization during 2 years.
Analyze the data of INN Hotels to find which factors have a high influence on booking cancellations, build a predictive model that can predict which booking is going to be canceled in advance, and help in formulating profitable policies for cancellations and refunds
A simple Neural Network Model to predict the housing price based on the house features like bedrooms, area, etc. We are using kaggle Housing Prices Dataset. The data has multicollinearity prob
Research, Analysis, and Final Paper for my Intro to Econometrics class taken in Fall 2023
This project employs a dataset of 103,904 entries with 25 features. Utilizing the XGBoost classifier,The workflow involves data fetching, feature selection, preprocessing, correlation analysis, best feature selection, data rescaling, train-test split, and target balancing. Predicts whether a customer will experience satisfaction with a flight.