ui-libraries / flint

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

mismatched number of senders and UTS

hzadeh17 opened this issue · comments

This is a kink to be worked out in the rounding and sender-matching method for finding duplicates.

In order for this method to work, the functions must find the same number of senders and UTS because they are simply matched up by enumeration.

Right now, there are 547 out of 10172 files (bookmarks for bookmarks with <10 pgs, or pgs for bookmarks with >11 pgs), or about 5.4% where there is a mismatched number. I got the number down from 818 just by going through and fixing problems.

mismatchedUTS_Sndr.txt

This file contains the files with mismatches. A future (somewhat mindless) task is to go through these, look at the textfile, and try to see what's missing from the output.

I started a document for doing this here.

Key: NOTIMESTAMP means there is no timestamp, which is not a problem because that's just a reality of the OCR. SENDERCUTOFF is also not a problem, likewise. \nS or \nT or \nD are also okay, that just means that the From was empty (will usu match up with NOTIMESTAMP.