Generating eval data
rsuwaileh opened this issue · comments
Hi,
Can you elaborate on the file format read in this line of code in generate_evaluation_data(corpus, file_name)
function in the preprocessing.py
file?
doc = nlp(codecs.open(directory + str(line_no), u"r", encoding=u"utf-8").read())
Is it a separate file for every line/document?
I'm getting this error:
IOError: [Errno 2] No such file or directory: u'../data/california_data/0'
I currently do not have such a directory as no instructions in the README file except providing the (name,,name,,lat,,lon,,start,end)
formatted file which I have already under data/
So not sure what should be expected under this <corpus>/<line_no>
directory?
Thanks!