Ransom's repositories


An analysis of a Giant retail supermarket chain's sales data for a period of 2.5 years using R to build a model for forecasting Sales



This is Andrew NG Coursera Handwritten Notes.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0


30 Days of React challenge is a step by step guide to learn React in 30 days. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw



A simple data science project on the California Housing Dataset with some Exploratory data analysis and use of Linear Regression models from sklearn and Seaborn plots

Language:Jupyter NotebookStargazers:0Issues:0Issues:0


This was a comprehensive project completed as part of the Data Science PG Programme. This covers classification algorithms over a dataset collected on health/diagnostic variables to predict of a person has diabetes or not based on the data points. Apart from extensive EDA to understand the distribution and other aspects of the data. Pre-processing was done to identify data which was missing or did not make sense within certain columns and imputation techniques were deployed to treat missing values. For classification the balance of classes was also reviewed and treated using SMOTE. Finally models were built and compared for accuracy on various metrics.Lastly the project contains a dashboard on the original data using Tableau

Language:Jupyter NotebookStargazers:0Issues:1Issues:0


One notebook to learn it all - Algorithms from scratch

Language:Jupyter NotebookStargazers:0Issues:0Issues:0


IPython notebooks and data for scikit-learn tutorial at the ML Berlin Meetup.



This project includes analysis of buyer's reviews/comments of a popular mobile phone from an e-commerce website . Analysis done for the project include pre-processing of text data such as word-tokenisation, lemmatisation. Followed by Topic-modeling using Latent Dirichlet Allocation, POS tagging, and topic interpretation for business use

Language:Jupyter NotebookStargazers:0Issues:0Issues:0


A project based on Mercedes Benz test bench data for vehicles at the testing and quality assurance phase. Data consists of high number of feature columns. Key highlights from the project include - Dimensionality reduction using PCA and XGBoost Regression used after the dimensionality reduction to predict the time required to test the vehicles.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0


Config files for my GitHub profile.



A collection of my data analysis and data science projects
