There are 12 repositories under duplicate-detection topic.
A plugin that does one thing only: Detect and manage duplicate items in Zotero.
Filter, Sort & Delete Duplicate Files Recursively
⚡ Check your npm modules for unused and duplicate dependencies fast
Interactive code for image similarity using SIFT algorithm
CLI utility to find near duplicate images and remove all but the best copy.
Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.
Vidupe is a program that can find duplicate and similar video files. V1.211 released on 2019-09-18, Windows exe here:
Find similar audio files easily
CLI tool that fast checks if your bundle contains multiple versions of the same package, only by looking in package.json.
Detecting near-duplicate videos by aggregating features from intermediate CNN layers
Nextcloud Media Duplicate Collector application
Remove Duplicate Messages
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
A uniquely crafted image viewer and editor with options to organize files, and maintain large lists of image files for slideshows, dupes detection or other purposes.
OpenStaticAnalyzer is a source code analyzer tool, which can perform deep static analysis of the source code of complex systems.
Welcome to Snowman App – a Data Matching Benchmark Platform.
Apps to find duplicate files including same/similar images & videos (with computer vision/AI)
A basic duplicate image detection service using perceptual image hash functions and nearest neighbor search, implemented using faiss, fastapi, and imagehash
An End-to-End Evaluation Framework for Entity Resolution Systems
Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivists within digital preservation departments in memory institutions. The tool will find duplicates, unidentified files, blacklisted objects, character encoding issues, and more.
This Python packages identifies duplicate files in a folder of interest.