Ransom's repositories
Retail-Supermarket-Store-Sales-Analysis
An analysis of a Giant retail supermarket chain's sales data for a period of 2.5 years using R to build a model for forecasting Sales
Andrew-NG-Notes
This is Andrew NG Coursera Handwritten Notes.
30-Days-Of-React-Fork
30 Days of React challenge is a step by step guide to learn React in 30 days. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw
California-Housing-Price-Prediction
A simple data science project on the California Housing Dataset with some Exploratory data analysis and use of Linear Regression models from sklearn and Seaborn plots
Capstone-Data-Science-Course-Project
This was a comprehensive project completed as part of the Data Science PG Programme. This covers classification algorithms over a dataset collected on health/diagnostic variables to predict of a person has diabetes or not based on the data points. Apart from extensive EDA to understand the distribution and other aspects of the data. Pre-processing was done to identify data which was missing or did not make sense within certain columns and imputation techniques were deployed to treat missing values. For classification the balance of classes was also reviewed and treated using SMOTE. Finally models were built and compared for accuracy on various metrics.Lastly the project contains a dashboard on the original data using Tableau
Machine-Learning-Algorithms
One notebook to learn it all - Algorithms from scratch
ml-berlin-tutorial
IPython notebooks and data for scikit-learn tutorial at the ML Berlin Meetup.
NLP-Course-Project-Review-Analysis-and-Topic-Modeling-with-LDA
This project includes analysis of buyer's reviews/comments of a popular mobile phone from an e-commerce website . Analysis done for the project include pre-processing of text data such as word-tokenisation, lemmatisation. Followed by Topic-modeling using Latent Dirichlet Allocation, POS tagging, and topic interpretation for business use
PCA-and-XGBoost-Regression-Mercedes-Benz-test-data
A project based on Mercedes Benz test bench data for vehicles at the testing and quality assurance phase. Data consists of high number of feature columns. Key highlights from the project include - Dimensionality reduction using PCA and XGBoost Regression used after the dimensionality reduction to predict the time required to test the vehicles.
Ransomk
Config files for my GitHub profile.
ransomk.github.io
A collection of my data analysis and data science projects