bzz / scholar-alert-digest

Aggregate unread emails from Google Scholar alerts

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Full duplicates in the output

jzuken opened this issue · comments

Occasionally there are duplicates in the tool output:

Screenshot 2019-12-06 at 11 17 38

Is it a bug or a feature? :)

commented

Thank you for reporting!

Is it a bug or a feature? :)

From the first glance, that seem like a bug to me :suspect:
If you could attach a gist with the full Markdown report - I'll be happy to take a deeper look asap.

Sure: https://gist.github.com/jzuken/3223bc3fe0bb2abcf0a70431a100f636

They seem to have different URLs, but exactly the same title. AFAIK, Scholar groups papers with matching substrings in titles (e.g. this paper has 12 versions). This might be too much, just grouping the exact matches would be awesome.

commented

Was closed by the automation on merging #16.
@jzuken would you be so kind and test the latest master to see if it fixes the issue for you?
Otherwise, please, feel free to re-open it. Thanks!

Looks good now, thank you!