kristiankoskimaki / vidupe

Vidupe is a program that can find duplicate and similar video files. V1.211 released on 2019-09-18, Windows exe here:

Home Page:https://github.com/kristiankoskimaki/vidupe/releases

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About improving recognition rate

m1517 opened this issue · comments

commented

Hello, this software is great.
I found a problem in use and hope to improve it.The beginning of the video were added some AD, making the video different in length and impossible to scan for duplicate items. I suppose to add a button to extract the image from the end of video and compare it? ^_^ thank you。

I will upload a new version that defaults to comparing the beginning and end separately, so a different intro or ending does not interfere. It now finds many more matches.

This is really the program's Achilles heel: If two videos differ by mere seconds, the comparison usually fails. But the comparison must be simple or the program grinds to a halt when comparing thousands of files.

The way I understand it it compares the whole group of screenshots (say 2x2) with another whole group. Could we add an option for the thumbnail generation and comparison to capture each individual screenshot by itself and compare it to other single screenshots? It would increase the processing exponentially but it seems like the thumbnail generation takes the most cpu time.

Good ideas, but having tested for a day I can now say that:
-CutEnds mode (comparing the first and last screen individually) from v1.21 finds much more matches than the old way of combining screens into a single image.
-None of the other thumbnail modes find more results so they are effectively useless now.
-Comparing just the first and last screens is enough, more than that never finds additional matches.
-Comparing every 12 screen from video1 with all 12 screens from video2 is indeed very slow and only finds a few additional false positives.

commented

Hi, I've been using it for a while and I can really find more repetitions. I sorted out the videos that I couldn't find, and found that most of them were of different durations.

I learned ffmpeg and read your source code, but I don't understand C++. I just found that you use the percentage of movie duration to take screenshots. The problem is that for videos with different duration, the location of screenshots will be different, resulting in lower accuracy.

Through my test, using ffmpeg -sseof -60 can not be affected by the duration, and a picture can also be. Can you change the 1x1 option to this? I don't know if the modification is troublesome. Anyway, thank you! I only know Chinese, so I only write these。