Skip to content

Generating eval data #2

@rsuwaileh

Description

@rsuwaileh

Hi,

Can you elaborate on the file format read in this line of code in generate_evaluation_data(corpus, file_name) function in the preprocessing.py file?

doc = nlp(codecs.open(directory + str(line_no), u"r", encoding=u"utf-8").read())

Is it a separate file for every line/document?
I'm getting this error:

IOError: [Errno 2] No such file or directory: u'../data/california_data/0'

I currently do not have such a directory as no instructions in the README file except providing the (name,,name,,lat,,lon,,start,end) formatted file which I have already under data/

So not sure what should be expected under this <corpus>/<line_no> directory?

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions