Select random data while train and test?Is that suitable?
Zhangang1999 opened this issue · comments
Zhangang1999 commented
I am not sure about that.Because will it lead to overfitting?
chiachen-chang commented
Yes, it's suitable to use the KDD dataset for unsupervised learning tasks. The training set includes only positive instances, while the testing set consists of both positive and negative ones. It's important to note that all the positive instances in the test set should be included in the training set, making this set-up appropriate for unsupervised learning.