Nicola Tonellotto's repositories
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
indexing-cw-with-terrier
Files to use for index Clueweb 09 and 12 collections with Terrier 4.2
progressbar
Terminal-based progress bar for Java/JVM
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
adaqs
AdaQS: Adaptive QuickScorer for Sparse Data and Regression Trees with Default Directions
async-profiler
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
bert-axioms
Code for ECIR'20 paper Diagnosing BERT with Retrieval Heuristics
caffeine
A high performance caching library for Java 8
cayman
Cayman is a Jekyll theme for GitHub Pages
chato-notes
A LaTeX package for notes / todo / questions / answers. Useful while writing academic papers
creme
:custard: Online machine learning in Python
cs646_tutorials
A tutorial of Galago, Lucene, and other tools for UMass CS646 students.
Distributed-and-Cluster-Computing
Course materials for CSC 496, Special Topic in Complex Systems, at West Chester University of Pennsylvania
Federated-Learning-PyTorch
Implementation of Communication-Efficient Learning of Deep Networks from Decentralized Data
hpsa
Course Material Repository for the High Performance & Scalable Analytics course of the Master in Big Data Analytics & Social Mining
luwak
A java library for stored queries
ml-workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
parallel_python
Parallel Programming with Python Tutorial
PartitionedEliasFano
Implementation of Partitioned Elias Fano compression algorithms
SearchEngine
A simple search engine that runs in a python notebook.
selective-search
Selective search partitions large scale dataset into subsets(shards) such that only few shards needs to be searched for a query, thus improving search efficiency and effectiveness
SIGIR19-BERT-IR
Repo of code and data for SIGIR-19 short paper "Deeper Text Understanding for IR with Contextual NeuralLanguage Modeling"
sklearn_tutorial
Materials for my scikit-learn tutorial
Text-Classification-Models-Pytorch
Implementation of State-of-the-art Text Classification Models in Pytorch
YCSB
Yahoo! Cloud Serving Benchmark
ytk-learn
Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).