legrandian's repositories
Oversampling-Techniques-for-Imbalanced-Data
Comparison of SMOTE, Borderline SMOTE, SVM SMOTE and Random Over-Sampler for highly imbalanced data.
Feature-Engineering-and-Selection-in-Python
Python code for examples from the book "Feature Engineering and Selection" by Max Kuhn and Kjell Johnson
Tesla-Valuation-and-Share-Price-Forecast-in-2025
Monte Carlo model in Python for estimating the share price for Tesla in 2025 based on ARK Invest's assumptions.
Association-Rules-Analysis-for-Heart-Disease-Data
Finding interesting relations in the patient data from the UCI heart disease database.
Conversion-Rate
Predict conversion rate and come up with recommendations for the product team and the marketing team to improve conversion rate.
COVID-19-Data-Tracker
Tracking how the coronavirus is spreading around the world.
Customer-Churn-Prediction
Churn prediction for customer retention. Based on Telco data.
Mind-Your-Units-In-A-B-Experiments
A simulation showing the effects of accounting for (or ignoring) the group structure in a group-randomized experiment.
Spanish-Translation-AB-Testing
A/B tests play a huge role in website optimization. Analyzing A/B tests data is a very important data scientist responsibility. Especially, data scientists have to make sure that results are reliable, trustworthy, and conclusions can be drawn.
Category-Encoding-Experiment
An experiment showcasing various encoding techniques on categorical variables with a lot of levels. The classic Titanic data set is used.
clv
customer lifetime value BG/NBD model
Employee-Retention
Predict when employees are going to quit by understanding the main drivers of employee churn.
Kazakhstan-Migration
D3.js visualization of international migration dynamics in Kazakhstan from 2000 to 2017.
Kazakhstan-Unemployment
D3.js visualization for the unemployment rate in Kazakhstan from 2001 to 2016.
KrishaScrapy
Scrapy tool for collecting Kazakhstan real estate data from http://krisha.kz.
Recommender-Systems-Showdown
Comparing error rates and model performance of popular recommendation system algorithms.
Shelter-Animal-Outcome-Predictor
Machine learning tool for predicting shelter animal outcome.
stattests
Source code to reproduce experiments from the article Practitioner’s Guide to Statistical Tests
Transfer-Learning-With-TensorFlow-Hub
Image classification of flowers using transfer learning based on Inception V3.
Undersampling-Techniques-Comparison-for-Imbalanced-Data
This is an experiment that compares performance and visualizes various undersampling techniques for highly imbalanced data.
Virtual-Realtor
Whatsapp bot for estimating prices of apartments in Kazakhstan.