sachinyar

sachinyar

Geek Repo

Github PK Tool:Github PK Tool

sachinyar's repositories

ipython_notebooks

Code snippets for reference

Stargazers:0Issues:0Issues:0

ML-mastery

Code from Jason Brownlee's course on mastering machine learning

Language:PythonStargazers:0Issues:0Issues:0

flight-data-analysis

US Flight Data Analysis from January 2016

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Simple-k-Means-Clustering-Python

Simple k-means clustering (centroid-based) using Python

Stargazers:0Issues:0Issues:0

movie-freak

A small movie recommendation system using OMDB API's

Language:PythonStargazers:0Issues:0Issues:0

SparkCourse

Taming Big Data with Apache Spark and Python - Hands On - Udemy

Language:PythonStargazers:0Issues:0Issues:0

spark_airline_delays

Rehash of HDP popular Predicting Airline Delays project

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Udemy---Machine-Learning

Notebooks for Course

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

awesome-machine-learning

A curated list of awesome Machine Learning frameworks, libraries and software.

Language:PythonLicense:CC0-1.0Stargazers:1Issues:0Issues:0

awesome-datascience

:memo: An awesome Data Science repository to learn and apply for real world problems.

License:MITStargazers:0Issues:0Issues:0

Machine-Learning-Tutorials

machine learning and deep learning tutorials, articles and other resources

License:CC0-1.0Stargazers:0Issues:0Issues:0

DataSciencePython

common data analysis and machine learning tasks using python

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Udemy-notes

My udemy notebooks

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Axa-Insurance-Telematics-Kaggle

I developed this case study only in 7 days with Pyspark (Spark 1.6.0) SQL & MLlib. I used Databricks cluster and AWS. %90 AUC is achieved (without involving Trip Matching-Repeated Trips feature) with Random Forest. Many ensembles with RF, GBT and Logistic Regression and outlier elimination could be used to improve this result. There are two versions of my code (test and full execution). Since AWS costs have exceeded my budget I sopped to train my model(s) all dataset for full dataset execution. There is also a ppt that presents my outputs in test execution. Full Data Execution code is more production ready and slightly different version. I had to use Databricks Table Caching to TRAIN and TEST data tables to obtain acceptable performance in production ready version.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

DAT8

General Assembly's 2015 Data Science course in Washington, DC

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

mGalarnyk.github.io

Simple website for now using Github.

Language:HTMLStargazers:0Issues:0Issues:0

ds-for-telco

Source material for Data Science for Telecom Tutorial at Strata Singapore 2015

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PCF-demo

System.exit

Language:JavaScriptStargazers:0Issues:0Issues:0

nyc-flights-analysis

Exploratory analysis bringing to bear all of my new skills in data manipulation and visualization in python.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

building-spark-applications-live-lessons

Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable Spark applications for predictive analytics in the context of a data scientist's standard workflow.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

DataScienceCourse

This holds iPython notebooks and lecture slides for the Intro to Data Science Master's course I teach at NYU.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:HTMLLicense:GPL-2.0Stargazers:0Issues:0Issues:0

Statistics-Notes

iPython NOtebooks on Stats

Language:Jupyter NotebookStargazers:0Issues:0Issues:0