Piotr Goliasz's repositories
snowplow-twitter-tracker
Follows Twitter users and tweets where they are mentioned. Posts found tweets to collector.
pio-template-kmeans-clustering
PredictionIO kmeans clustering template. Designed for 2D points.
PythonDataScienceHandbook
Jupyter Notebooks for the Python Data Science Handbook
template-scala-parallel-svd-item-similarity
Prediction.IO template for item similarity measurement
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
couchbase-tools
Simple couchbase tools
Detectron
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
dev-guide-sp-kafka-bq
code for talk developers guide for realtime pipeline with snowplow kafka and bigquery
doca
A CLI tool that scaffolds API documentation based on JSON HyperSchemas.
freegeoip
IP geolocation web server
kaggle-competitions
Code used for participation in kaggle competitions
kubernetes-elasticsearch-cluster
Elasticsearch cluster on top of Kubernetes made easy.
LSTM-Neural-Network-for-Time-Series-Prediction
LSTM built using Keras Python package to predict time series steps and sequences. Includes sin wave and stock market data
RLTrader
A cryptocurrency trading environment using deep reinforcement learning and OpenAI's gym
seismichadoop
System for performing seismic data processing on a Hadoop cluster.
sigrun
Pure java Seg-Y parser.
sl-quant
Companion code for the "Self Learning Quant" blog post
snowplow-flink-enriched2json
Example of Flink job translating snowplow enriched events to json format
snowplow-kafka-sink
Monitor kafka topic. Publish to snowplow tracker.
superset
Docker image for AirBnB's Superset
tsfresh
Automatic extraction of relevant features from time series:
vimeo.py
Official Python library for the Vimeo API.