Roman's repositories
customer-analytics-pydata-workshop-march-2018
Repo containing info and exercises about the customer analytics / pydata workshop in March 2018
Flask
Flask
serverless-python-sample
A simple serverless python sample with REST API endpoints and dependencies
python-machine-learning-book
The "Python Machine Learning" book code repository and info resource
DublinDataEngineering
The Open Source resources in Data Engineering, Machine Learning, Data Science areas, inspired by [The Open-Source Data Science Masters] (http://datasciencemasters.org/).
Two-Sigma-Connect-Rental-Listing-Inquiries
Two Sigma and RentHop, a portfolio company of Two Sigma Ventures, invited Kagglers to unleash their creative engines to uncover business value in this unique recruiting competition. RentHop makes apartment search smarter by using data to sort rental listings by quality. But while looking for the perfect apartment is difficult enough, structuring and making sense of all available real estate data programmatically is even harder. Two Sigma invited kagglers to apply their talents in this recruiting competition featuring rental listing data from RentHop. Kagglers will predict the number of inquiries a new listing receives based on the listing’s creation date and other features. Doing so will help RentHop better handle fraud control, identify potential listing quality issues, and allow owners and agents to better understand renters’ needs and preferences. Two Sigma has been at the forefront of applying technology and data science to financial forecasts. While their pioneering advances in big data, AI, and machine learning in the financial world have been pushing the industry forward, as with all other scientific progress, they are driven to make continual progress. This challenge is an opportunity for competitors to gain a sneak peek into Two Sigma's data science work outside of finance.
seaborn
Statistical data visualization using matplotlib
ThinkStats2
Text and supporting code for Think Stats, 2nd Edition
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
AWS_Lambda_in_Action
This source code distribution is a companion to the AWS Lambda in Action book available from Manning Publications.
Introduction-to-Git-and-GitHub
Git is the most popular distributed version control system. Git is an open source and it is created by the same people who developed Linux (Linus Torvalds). Git allows you to track files and file changes in a repository “repo” (folder). Everything is stored in local repositories on your computer. Git makes changes and tracks modification to the files stored in GitHub. Git synchronize repository and code between different people.
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
Data-Wrangling-with-R
Meetup
PythonDataScienceHandbook
Jupyter Notebooks for the Python Data Science Handbook
Shiny
Simple shiny app
Lending-Club
Analysis and modeling Lending Club datasets
movies-python-bolt
Neo4j Example application with flask backend using the neo4j-python-driver
ggthemes
ggplot themes and scales
LearnJava
Learn Java Programming
things
Logical puzzles, utils, snippets, things
RepData_PeerAssessment1
Peer Assessment 1 for Reproducible Research
DataScienceSpecialization.github.io
http://DataScienceSpecialization.github.io
scikit-learn
scikit-learn: machine learning in Python
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
statsintro
Introduction to Statistics
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more