Niccolo Becchi's repositories
elasticsearch-index-cloner
Simple java tool just to tranfer/copy/clone an elasticsearch index on different cluster using the REST endpoinds and migrating settings and mappings as well
AutoCrawler
Google, Naver multiprocess image web crawler (Selenium)
awesome-crawler
A collection of awesome web crawler,spider in different languages
aws-sdk-java
The official AWS SDK for Java.
coursera-intro-recommendation-systems
Programming assignments for Introduction to Recommendation Systems course on Coursera.org
datafactorytest
TestDataFactoryOnGitRepo
deepjazz
Deep learning driven jazz generation using Keras & Theano!
display-advertising-challenge
Criteo/Kaggle Competition of CTR prediction
docker-airflow
Docker Apache Airflow
docker-elk
The ELK stack powered by Docker and Compose.
elasticsearch-knapsack
Knapsack plugin is an import/export tool for Elasticsearch
elasticsearch-readonlyrest-plugin
Safely expose Elasticsearch REST API directly to the public
extruct
Extract embedded metadata from HTML markup
my-simple-skeleton-spark
Spark & Scala project skeleton
nutch-circle-ci
Apache Nutch is an extensible and scalable web crawler
puphpet
Vagrant/Puppet GUI
py-ms-cognitive
Thin wrapper for the Microsoft Cognitive Services
sample-pyspark-application
A sample PySpark application demonstrating how to bundle your Python dependencies on YARN
solr-recommender
Solr + Mahout Item-based recommender and cross-recommender
spark-scala-tutorial
A free tutorial for Apache Spark.
stanford-cs-221-artificial-intelligence
VIP cheatsheets for Stanford's CS 221 Artificial Intelligence
t-digest
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
test-ci-google-cloud
TestingSetupCiOnGoogleCloud
test_cookiecutter
TestCookieCutterCi