tokiran's repositories
Data-Science--Cheat-Sheet
Cheat Sheets
Data-Science-Competitions
Goal of this repo is to provide the solutions of all Data Science Competitions(Kaggle, Data Hack, Machine Hack, Driven Data etc...).
Hands-On-Ensemble-Learning-with-Python
Hands-On Ensemble Learning with Python, published by packt publishing
stockpredictionai
In this noteboook I will create a complete process for predicting stock price movements. Follow along and we will achieve some pretty good results. For that purpose we will use a Generative Adversarial Network (GAN) with LSTM, a type of Recurrent Neural Network, as generator, and a Convolutional Neural Network, CNN, as a discriminator. We use LSTM for the obvious reason that we are trying to predict time series data. Why we use GAN and specifically CNN as a discriminator? That is a good question: there are special sections on that later.
ad_examples
A collection of anomaly detection methods (iid/point-based, graph and time series) including active learning for anomaly detection/discovery, bayesian rule-mining, description for diversity/explanation/interpretability. Analysis of incorporating label feedback with ensemble and tree-based detectors. Includes adversarial attacks with Graph Convolutional Network.
Artificial-Intelligence
Awesome AI Learning with +100 AI Cheat-Sheets, Free online Books, Top Courses, Best Videos and Lectures, Papers, Tutorials, +99 Researchers, Premium Websites, +121 Datasets, Conferences, Frameworks, Tools
awesome-business-intelligence
Actively curated list of awesome BI tools. PRs welcome!
awesome-streaming
a curated list of awesome streaming frameworks, applications, etc
data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
DataSphereStudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
ibm-serverless-workshop
Build serverless applications with Apache OpenWhisk
Machine-Learning-Workflow-with-Python
This is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
machine_learning_examples
A collection of machine learning examples and tutorials.
MorphL-Community-Edition
MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization
Movalytics-Data-Warehouse
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
practicalAI
📚 A practical approach to machine learning.
project-based-learning
Curated list of project-based tutorials
PySpark
PySpark functions and utilities with examples. Assists ETL process of data modeling
pyspark-example-project
Example project implementing best practices for PySpark ETL jobs and applications.
pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
Spark-with-Python---My-learning-notes-
ETL pipeline using pyspark (Spark - Python)
spark_scala_ml_examples
Spark 2.0 Scala Machine Learning examples
TransmogrifAI
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
UdacityOpenSource
A repository to keep all open sources projects that created by individuals or study groups.