XingyuXiong / 2021CIDhw3

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

2021CIDhw3

KDDTest.csv is the test data(Set1 2 3 are temporary files). 20 Percent Training Set.csv is the training data(for K-NN).

Here list the dataset and my scripts(python files). Run python preprocess.py to process the data(training data or test data, choosed in parameters)

test.py is a script in development. I try to calculate the nearest distance of all 20000+ rows in test dataset using test function, which is really slow.

About


Languages

Language:Python 100.0%