Thibault Debatty's repositories
java-string-similarity
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
spark-knn-graphs
Spark algorithms for building k-nn graphs
java-graphs
Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...
php-language-processing
A PHP library for language processing. Includes string distance function (Levenshtein, Jaro-Winkler,...), stemming, etc.
java-spamsum
A Java implementation of SpamSum / SSDeep
php-clustering
Clustering algorithms for PHP
java-datasets
Java library for parsing various datasets: ENRON email dataset, Wikipedia web pages, DBLP papers, Reuters news ...
php-vector-matrix
A PHP library for vectors and matrices algebra
laravel-resource-generator
A complete code generator for Laravel resources (includes fully working controller code, views etc.)
hadoop-clustering
Algorithms to perform clustering with Hadoop
hadoop-knn-graph
Hadoop implementation of KNN graph building algorithms (Brute force, NNDescent, NNCtph, ...)
php-odt2html
PHP library to convert Openoffice files (ODT) to HTML
spark-kmedoids
Spark implementation of k-medoids clustering algorithm
java-aggregation
Java implementation of aggregation operators: WA, OWA and WOWA
bbb-recorder
BigBlueButton recorder using puppeteer to export as webm or mp4 file & Live RTMP broadcasting
BlackWidow
A Python based web application scanner to gather OSINT and fuzz for OWASP vulnerabilities on a target website.
php-aggregation-operators
PHP implementations of Weighted Ordered Weighted Aggregation (WOWA), Ordered Weighted Averaging (OWA), etc.
php-data-structures
Data structures implemented in PHP : KDTree,...
php-deployer
Simple deployer for GIT projects
php-simple-html-dom-parser
PHP Simple HTML DOM Parser adaptation for Composer and PSR-0
sitespeed.io
Sitespeed.io is an open source tool that helps you monitor, analyze and optimize your website speed and performance, based on performance best practices advices from the coach and collecting browser metrics using the Navigation Timing API, User Timings and Visual Metrics (FirstVisualChange, SpeedIndex & LastVisualChange).
sparkpackage-maven-plugin
Maven plugin for publishing on spark-packages
vagrant-appindicator
Vagrant Application Indicator for Ubuntu