logpai / loghub

A large collection of system log datasets for AI-driven log analytics [ISSRE'23]

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

HDFS log entry count differ between loghub and Deeplog paper

EricWebsmith opened this issue · comments

in Loghub, HDFS1 has 11,175,629 entries.
in Deeplog, the number is 11,197,954 logs.
I am confused.

thanks

Sorry for late replay. Our data did not come from Deeplog's paper. We use the same set with our ISSRE paper.

Hi, it would be really great if you could change the readme of your HDFS logs.
It looks like HDFS_1 is the same Dataset as used in Detecting Large-Scale System Problems by Mining Console Logs, however the total amount of lines and anomalys is not the same. Therefore, all my work using HDFS_1 and referencing to this publication is wrong :/