milangritta / Geocoding-with-Map-Vector

Resources for the ACL 2018 publication "Which Melbourne? Augmenting Geocoding with Maps", published in July 2018.

Home Page:https://www.aclweb.org/anthology/P18-1119/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Generating eval data

rsuwaileh opened this issue · comments

Hi,

Can you elaborate on the file format read in this line of code in generate_evaluation_data(corpus, file_name) function in the preprocessing.py file?

doc = nlp(codecs.open(directory + str(line_no), u"r", encoding=u"utf-8").read())

Is it a separate file for every line/document?
I'm getting this error:

IOError: [Errno 2] No such file or directory: u'../data/california_data/0'

I currently do not have such a directory as no instructions in the README file except providing the (name,,name,,lat,,lon,,start,end) formatted file which I have already under data/

So not sure what should be expected under this <corpus>/<line_no> directory?

Thanks!