Select random data while train and test?Is that suitable?

Question

Select random data while train and test?Is that suitable?

Zhangang1999 opened this issue 4 years ago · comments

Zhangang1999 commented 4 years ago

I am not sure about that.Because will it lead to overfitting?

chiachen-chang · Answer 1 · Thu Jun 29 2023 15:36:43 GMT+0800 (China Standard Time)

Yes, it's suitable to use the KDD dataset for unsupervised learning tasks. The training set includes only positive instances, while the testing set consists of both positive and negative ones. It's important to note that all the positive instances in the test set should be included in the training set, making this set-up appropriate for unsupervised learning.