xflr6 / duplicates

Duplicate file CSV report via SQLite database with file infos

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Duplicates

Build Codecov

This Python script walks the current directory creating an SQLite database with the path, size, and mtime for each file. After computing and adding the md5 checksum for all duplicate candidates (files with the same byte size), it generates a CSV giving the following infos for each file that has the same content (md5sum) as another file:

  • location (path relative to the initial directory)
  • md5sum, size, and mtime
  • name (basename) and ext (file extension)

Dependencies

About

Duplicate file CSV report via SQLite database with file infos

License:MIT License


Languages

Language:Python 100.0%