knn-imputer

There are 1 repository under knn-imputer topic.

miriamspsantos / heterogeneous-distance-functions
A collection of heterogeneous distance functions handling missing values.
distance-functions distance-measures heom heterogeneous-data hvdm knn knn-imputation knn-imputer machine-learning missing-data missing-values research-paper
Language:MATLAB 5
SebastianRokholt / Data-Science-Projects
A repository for various Data Science projects I've worked on, both university-related and in my spare time.
machine-learning web-science network-analysis gephi data-science pandas matplotlib selenium python xgboost knn-imputer mplfinance house-price-prediction multinomial-naive-bayes stochastic-gradient-descent deep-learning pytorch svm backpropagation data-science-projects
Language:Jupyter Notebook 3
SINGHxTUSHAR / Sensor-Fault-Detection
Data fetched by wafers is to be passed through the machine learning pipeline and it is to be determined whether the wafer at hand is faulty or not apparently obliterating the need and thus cost of hiring manual labour.
deployment-docs gradientboostingclassifier pipelines randomforrestclassifier svc-model xgbclassifier classification-algorithm flask-api knn-imputer simple-imputer
Language:Jupyter Notebook 3
ZL63388 / data-preparation-codes
This repository is a collection of basic code templates for Data Preparation. All codes I am sharing are from the practical exercises I did from the Data Science Infinity Program.
pandas simpleimputer knn-imputer onehot-encoding outlier-detection feature-scaling feature-selection
Language:Python 2
zuhaib1214 / Feature-Engineering
This repository is totally focused on Feature Engineering Concepts in detail, I hope you'll find it helpful.
binarization feature-engineering labelencoder normalisation onehot-encoding ordinal-encoding percentile-method principal-component-analysis simpleimputer standardization winsorization z-score discritisation mean-median-imputation iterative-imputer knn-imputer frequent-value-imputation
Language:Jupyter Notebook 1
Allen-Ho-0302 / First-Time-Eligible-Arbitration-Salary-Prediction
Modelling the relationship between a player’s first-time eligible arbitration salary and multiple variables.
heatmap-visualization knn-imputer lightgbm-regressor python random-forest smogn
Language:Jupyter Notebook 0
AyushTyagi1610 / Credit-Risk-Modelling
Built a model to determine the risk associated with extending credit to a borrower. Performed Univariate and Bivariate exploration using various methods such as pair-plot and heatmap to detect outliers and to monitor the behaviour and correlation of the features. Imputed the missing values using KNN Imputer and implemented SMOTE to address the imbalanced data. Trained the model using KNN, Decision Trees, Logistic Regression and Random Forest to achieve the best accuracy of 93%.
datacleaning knn-imputer pandas python random-forest smote-oversampler visualisation
Language:Jupyter Notebook 0
bortch / second_hand_UK_car_challenge
Kaggle UK Used Car challenge
kaggle knn-imputer machine-learning random-forest
Language:Python 0
dfavenfre / customer_deposit_classifier
Streamlit app developed for bank customer deposit prediction, using a fine-tuned XGBClassifier model.
banking finance knn-imputer rfecv smote xgboost-classifier
Language:Jupyter Notebook 0
MariaDimopoulou / Churn-Prediction-Customer-Segmentation-in-E-Commerce
This project focuses on predicting customer churn in an e-commerce setting using machine learning techniques.
classification clustering dbscan kmeans-clustering knn-imputer matplotlib pandas pca roc-curve seaborn silhouette-score smote tsne xgboost
Language:Jupyter Notebook 0
nf-i / data-imputation-python
Data imputation is used when there are missing values in a dataset. It helps fill in these gaps with estimated values, enabling analysis and modeling. Imputation is crucial for maintaining dataset integrity and ensuring accurate insights from incomplete data.
data-imputation knn-imputer python simple-imputer sklearn mice-imputer sklearn-impute
Language:Python 0
HousePricePrediction
NMARGOS / HousePricePrediction
[Kaggle Submission] -Using XGBRegressor with shap, grid search and hyperopt to predict house prices
data-cleaning data-science gridsearchcv hyperopt knn-imputer machine-learning-algorithms pandas-python shap supervised-learning xgboost-regression
Language:Jupyter Notebook 0
ntyblco / ML_Prediction_RF_KNN
Predicting employee burnout using machine learning algorithms: Random Forest and k-Nearest Neighbors.
burnout knn-imputer knn-regression machine-learning random-forest
0
Seghelicious / Cars45
regression preprocessing correlation-coefficient model-development knn-regressor random-forest-regressor knn-imputer data-cleaning grid-search-cv standard-scaler normalization standardization log-transform cross-validation pipelines extreme-values
Language:Jupyter Notebook 0
whoisksy / predict-home-loan-sanction-amount
regression-models machine-learning-algorithms knn-imputer loan-prediction-analysis missing-values visualization data-preprocessing
Language:Jupyter Notebook 0
ZG3Z / BTS-Weather-Clustering
clustering dataintegration geolocation knn-imputer nominatim plotly-express aglomerative-hierarchical-clustering kmeans-clustering
Language:Jupyter Notebook 0
Gui-Sitton / Zyfra
The company develops efficiency solutions for heavy industry. The model should predict the amount of pure gold extracted from gold ore. You have the data on extraction and purification. The model will help optimize production and eliminate unprofitable parameters.
data-science knn-imputer knn-regression machine-learning predictive-modeling python
Language:Jupyter Notebook
HuzeyfeAyaz / Knn-Imputer-With-Hamming-Distance
Filling missed data-points with the most common values among nearest neighbors
hamming hamming-distance knn knn-imputer python python3
Language:Python
kritika755 / wafer_circleci
This flask web app is used to detect if a wafer(sensor chip) is default or not based on sensor readings.
python flask machine-learning-algorithms knn-imputer xgboost random-forest numpy pandas sqlite heruko circleci docker
Language:Python
nani757 / multivariate-analysis
the multivariate analysis compares different rows and columns for beat accuracy eg:knn imputer in univariate analysis it only compares with the same columns eg mean or median for numbers
mice-algorithm knn-imputer iterative-imputer
Language:Jupyter Notebook
YaserEleraky / Aviation-Accident-NTSB-The-National-Transportation-Safety-Board
Analysis about Accident Aviation from 1962 up to 2023
accident aviation knn-imputer pandas plotly-python python scraping-web
Language:Jupyter Notebook
YD5463 / TabularDataProject
we perpuse a method to fill nan values using clustering
clustering dbscan-clustering knn-imputer python
Language:Jupyter Notebook

knn-imputer

miriamspsantos / heterogeneous-distance-functions

SebastianRokholt / Data-Science-Projects

SINGHxTUSHAR / Sensor-Fault-Detection

ZL63388 / data-preparation-codes

zuhaib1214 / Feature-Engineering

Allen-Ho-0302 / First-Time-Eligible-Arbitration-Salary-Prediction

AyushTyagi1610 / Credit-Risk-Modelling

bortch / second_hand_UK_car_challenge

dfavenfre / customer_deposit_classifier

MariaDimopoulou / Churn-Prediction-Customer-Segmentation-in-E-Commerce

nf-i / data-imputation-python

NMARGOS / HousePricePrediction

ntyblco / ML_Prediction_RF_KNN

Seghelicious / Cars45

whoisksy / predict-home-loan-sanction-amount

ZG3Z / BTS-Weather-Clustering

Gui-Sitton / Zyfra

HuzeyfeAyaz / Knn-Imputer-With-Hamming-Distance

kritika755 / wafer_circleci

nani757 / multivariate-analysis

YaserEleraky / Aviation-Accident-NTSB-The-National-Transportation-Safety-Board

YD5463 / TabularDataProject