There are 1 repository under deduplicate-data topic.
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
General deduping engine for JDBC sources with output to JDBC/csv targets
Sort and deduplicate data.