There are 1 repository under rfe topic.
Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshadri. Collaborators welcome.
A Linear Regression model to predict the car prices for the U.S market to help a new entrant understand important pricing variables in the U.S automobile industry. A highly comprehensive analysis with detailed explanation of all steps; data cleaning, exploration, visualization, feature selection, model building, evaluation & MLR assumptions validity.
Implements an entire machine learning pipeline to train and evaluate a Random Forest Classifier on labeled gait data for walking. Data generated during the experiment has led to helpful insights in to the problem domain.
The given information of network connection, model predicts if connection has some intrusion or not. Binary classification for good and bad type of the connection further converting to multi-class classification and most prominent is feature importance analysis.
HR Analytics Dataset
Alignment-free method to identify and analyse discriminant genomic subsequences within pathogen sequences
Computer Intelligence subject final project at UPC.
A multiple linear regression model for the prediction of car prices.
Car Price Prediction
Predict the vehicle price from the open source Auto data set using linear regression. In this data set, we have prices for 205 automobiles, along with other features such as fuel type, engine type,engine size,etc.
Machine Learning Telecom Churn Model
King County House Sales
Building a model to predict demand of shared bikes. It will be used by the management to understand how exactly the demands vary with different features. They can accordingly manipulate the business strategy to meet the demand levels.
The goal of this project is to garner data insights using data analytics to purchase houses at a price below their actual value and flip them on at a higher price. This project aims at building an effective regression model using regularization (i.e. advanced linear regression: Ridge and Lasso regression) in order to predict the actual values of prospective housing properties and decide whether to invest in them or not.
Predictive model that tells important factors(or features) affecting the demand for shared bikes
To identify the variables affecting house prices :Multiple Linear Regression in Python using statsmodels and RFE
Build a logistic regression model to assign a lead score between 0 and 100 to each of the leads which can be used by the company to target potential leads. A higher score would mean that the lead is hot, i.e. is most likely to convert whereas a lower score would mean that the lead is cold and will mostly not get converted.
A US bike-sharing provider BoomBikes has recently suffered considerable dips in their revenues due to the ongoing Corona pandemic. The company is finding it very difficult to sustain in the current market scenario. So, it has decided to come up with a mindful business plan to be able to accelerate its revenue as soon as the ongoing lockdown comes to an end, and the economy restores to a healthy state.
Machine Learning Project
project for the practice of webscraping, APIs, machine learning, feature selection
Analyze customer-level data of a leading telecom firm, build predictive models to identify customers at high risk of churn (usage-based churn) and identify the main indicators of churn.
Bike Sharing in Washington D.C.
In this project, data analytics is used to analyze customer-level data of a leading telecom firm, build predictive models to identify customers at high risk of churn, and identify the main indicators of churn. The project focuses on a four-month window, wherein the first two months are the ‘good’ phase, the third month is the ‘action’ phase, while the fourth month is the ‘churn’ phase. The business objective is to predict the churn in the last i.e. fourth month using the data from the first three months.
Previsão de Fraude Financeiro
A telemarketing model to predict campaign subscriptions in a portuguese bank institution.
Multiclass classification model of penguins species.
Predicting the variables that effects the revenue of the bike sharing company after a serious drop-fall during the covid-19 pandemic.
Regression Model using regularisation to predict the actual value of the prospective properties and decide whether to invest in them or not.
Build a classification model for reducing the churn rate for a telecom company
In this project we built a model to predict whether a person will remain in a hypothetical trade union called the United Data Scientists Union (UDSU).
Consider a real estate company that has a dataset containing the prices of properties in the Delhi region. It wishes to use the data to optimise the sale prices of the properties based on important factors such as area, bedrooms, parking, etc. Essentially, the company wants — To identify the variables affecting house prices, e.g. area, number of rooms, bathrooms, etc. To create a linear model that quantitatively relates house prices with variables such as number of rooms, area, number of bathrooms, etc. To know the accuracy of the model, i.e. how well these variables can predict house prices.
Before training a model or feed a model, first priority is on data,not in model. The more data is preprocessed and engineered the more model will learn. Feature selectio one of the methods processing data before feeding the model. Various feature selection techniques is shown here.
[Codenation] Feature Selection w/ Recursive Feature Elimination (aka RFE) and Dimensionality Reduction using Principal Component Analysis (aka PCA)
Crafting static and dynamic models for data exfiltration detection via DNS traffic analysis. Static model trained on batch data, while dynamic model simulates a continuous stream. Rigorous analysis, feature engineering, and model training conducted. Implementation part of AI for Cyber Security Master's assignment at the University of Ottawa, 2023.
Bank Customer Behaviour Prediction