Randy Leon (randleon)


Geek Repo

Location:Greater New York City Area

Github PK Tool:Github PK Tool

Randy Leon's repositories


assignments and projects for Yeshiva University's Katz School Information Architectures course, spring 2020

Language:Jupyter NotebookStargazers:2Issues:1Issues:0


👋 Hi, I’m Randy! 👀 I’m interested in becoming a data scientist 🌱 I’m currently learning Python, SQL, Tableau, and AWS 💞️ I’m looking to collaborate on beginner to intermediate data science projects to showcase some skills! 👀 Some of my interests including weightlifting, geopolitics, and Yu-Gi-Oh the Card Game.


assignments and projects for Yeshiva University's Katz School Analytics Programming course, fall 2019

Language:Jupyter NotebookStargazers:1Issues:1Issues:0


Sample dashboards to showcase my work in database and BI tools



Cleaning a wine data set using Python 3 in a Jupyter Notebook. Packages include Seaborn, NumPy, and Sklearn.

Language:Jupyter NotebookStargazers:1Issues:1Issues:0


Naïve Bayes classifiers are widely recognized for their efficacy at classifying text data (e.g., sentiment analysis). Many organizations rely on sentiment analysis algorithms to help them gauge the opinions of both existing and potential customers. Sentiment analysis algorithms to the online product/service reviews help influence business decisions

Language:Jupyter NotebookStargazers:1Issues:1Issues:0


assignments and projects for Yeshiva University's Katz School Structured Data Management course, fall 2020


Final Project at YU



Decision trees and random forest models can both be very effective when applied to classification problems. We compared the performance vs. complexity payoff between both models in this example using Pandas and NumPy

Language:Jupyter NotebookStargazers:0Issues:1Issues:0


Linear Regression project on automobile data featuring checks using k-fold cross validation.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:1Issues:0


assignments and projects for Yeshiva University's Katz School Structured Visual Design and Storytelling, fall 2019

Language:Jupyter NotebookStargazers:0Issues:1Issues:0


Prepared the UCI Mushroom data for construction of predictive models. My team and I also cross-trained the models for accuracy and precision.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0


Particular interest to most online retailers is whether or not a site visitor ends up executing a purchase while engaged with the web site. We used supervised learning methods such as K-nearest neighbors and support vector machines in Python to predict whether or not online shoppers were more willing to make a purchase.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0


Allocated columns of missing data to other workbooks. I transformed and concatenated the column data across using VLOOKUP and then combined the tables all into one new sheet for reference.



Data science project applying feature selection/dimensionality reduction techniques to identify the explanatory variables to be included within a linear regression model that predicts the number of times an online news article will be shared using Python 3 in a Juypter Notebook.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0


Constructing and compare/contrast a series of regression models that predict the number of student “dropouts” in a school dataset relative to certain properties/characteristics of a given school district + associated student subgrouping.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0


Python project that used KNN and SVM models to classify insurance data found on Kaggle.com

Language:Jupyter NotebookStargazers:0Issues:1Issues:0


Making a dynamic sales dashboard from sample sales data



Our team sought to perform sentiment analysis on Twitter tweets in anticipation for Hideo Kojima's video game release, Death Stranding, in 2019. We sourced the Tweets from two libraries, preprocessed them, stored them using MongoDB and then performed sentiment analysis.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0


Evaluation of the performance of classification models can be facilitated through a combination of calculating certain types of performance metrics and generating model performance evaluation graphics. The purpose of this exercise is to calculate a suite of classification model performance metrics via Python code functions.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0