Diego's starred repositories
resume.github.com
Resumes generated using the GitHub informations
deeplearning4j
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...
elasticsearch-dump
Import and export tools for elasticsearch & opensearch
open-source-search-engine
Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.
datumbox-framework
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
annotorious-v1
Project has moved to http://github.com/annotorious/annotorious
brown-cluster
C++ implementation of the Brown word clustering algorithm.
quickscrape
A scraping command line tool for the modern web
java-dirty
File-backed append-only object store.
autocomplete
Solr advanced autocomplete example
opennlp-italian-models
Models for POS tagging and sentence and tokens detection with OpenNLP tools for italian language
italian-nlp-library
A library to run NLP tasks on Italian language
groupvarint
groupvarint: integer compression
dexter-hadoop
A small plugin to run Dexter annotations in Hadoop MapReduce environment
entityAnalysis
Analyzing entities in queries