There are 0 repository under data-sketching topic.
A Clojure library for querying large data-sets on similarity
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Sketching data structures for scala, including t-digest
ExaLogLog: Space-Efficient and Practical Approximate Distinct Counting up to the Exa-Scale
This project aims to use Yahoo Theta Sketch api as Spark sql UDFs
UltraLogLog: A Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting
A barebones implementation of the simhash data sketching algorithm.
Type-classes to interface isarn-sketches with Algebird