Benita Diop (BenitaDiop)

BenitaDiop

Geek Repo

Location:NY, SF

Home Page:BenitaDiop.com

Github PK Tool:Github PK Tool

Benita Diop's repositories

FullStackBigData-with-SPARK

Pulled 10GB ofYelp Business data through the terminal via Kaggle API. The data was then pushed to and AWS S3 Bucket bucket for storage and analyzed on a Elastic MapReduce Cluster on a Jupyter Notebook using PySpark

Language:Jupyter NotebookStargazers:2Issues:2Issues:0

MiniBOONEClassification

In this dataset, we have 130K observations with 50 features. The features are measurements of Cherenkov light and scintillation light using hit topology and timing. There are 36.5K observations for electron neutrinos and 93.5K observations for muon neutrinos, which yields an imbalance ratio of 0.39

Language:RStargazers:1Issues:2Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Stargazers:0Issues:0Issues:0

AnalysisOfCrimeInIndia

The dataset that I am performing this regression analysis on, comes from Kaggle, titled crimes In India. This dataset holds complete information about various aspects of crimes that have taken place in India in a 17 year span, from 2001 to 2018.

Language:RStargazers:0Issues:2Issues:0
Language:SASStargazers:0Issues:2Issues:0

PythonMicroserviceDeployment_SocrataAPI

In this project I leveraged Socrata OPCV API data to build a pipeline of logs from Docker Container to the Elasticsearch, Kibana Stack where data was collected, analyzed and transformed into visuals. Scripts were written in python and polished to be able to take in command line arguments from UNIX/LINUX operating systems. The scripts were tested for reproducibility by provisioning, configuring and executing an AWS EC2 instance which ran on a Docker container, read-in the python script, parsed in parameters from the command line and pull the API JSON logs. Additionally Git was utilized to maintain version control and prevent the confliction of concurrent work.

Language:PythonStargazers:0Issues:2Issues:0

StatisticalHypothesisTesting

Hypothesis testing four datasets on SAS using Hotelling t-squared hypothesis testing tool to validate all parametric estimates and to conclude if to accept or to reject the given hypothesis

Language:SASStargazers:0Issues:0Issues:0

coding-interview-university

A complete computer science study plan to become a software engineer.

License:CC-BY-SA-4.0Stargazers:0Issues:1Issues:0

courses-introduction-to-sql

Introduction to SQL by Nick Carchedi

Language:PythonStargazers:0Issues:1Issues:0

data

Dataset collection

Stargazers:0Issues:2Issues:0

DataStructures-Algorithms

Master Data Structures & Algorithms With Me =]

Stargazers:0Issues:2Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:1Issues:0

DesignPatterns101

Lets Fall In Love With Design Patterns Together !

Stargazers:0Issues:0Issues:0
Language:ShellStargazers:0Issues:0Issues:0

docs

TensorFlow documentation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0
Language:Jupyter NotebookLicense:GPL-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Linear-Algebra

Linear Algebra

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:SASStargazers:0Issues:0Issues:0

NaturalScienceSeminar

Returning to my alma mater to give a talk on statistics and data science. This repo is to provide attendees of the Natural Science Seminar with all the material covered during the talk.

Language:RStargazers:0Issues:1Issues:0

OOP-in-Python

Master Object Oriented Programming in Python With Me =]

Stargazers:0Issues:2Issues:0
Language:RLicense:MITStargazers:0Issues:2Issues:0

Python

All Algorithms implemented in Python

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

python_advanced

Μίνι σειρές στην Python (έπονται της βασικής σειράς)

Language:PythonStargazers:0Issues:1Issues:0

Regression_TensorFlow

Ordinary least squares, Polynomial Regression, General linear model methodologies and cross validation

Stargazers:0Issues:2Issues:0

repo-info

Extended information (especially license and layer details) about the published Official Images

Language:PerlLicense:Apache-2.0Stargazers:0Issues:1Issues:0

sql

Youtube Tutorial - SQL

Stargazers:0Issues:1Issues:0