EmilHvitfeldt / textdata

Download, parse, store, and load text datasets instead of storing it in packages

Home Page:https://emilhvitfeldt.github.io/textdata/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

lexicon_nrc() broken due to structural changes in source ZIP archive

grantdick opened this issue · comments

lexicon_nrc() fails to run due to missing file:

It appears that there has been a change to the structure of the file downloaded from: http://saifmohammad.com/WebDocs/Lexicons/NRC-Emotion-Lexicon.zip
(according to http://saifmohammad.com/WebPages/NRC-Emotion-Lexicon.htm this file was updated in August 2022)

Seems like the path that is currently specified in process_nrc():
"NRC-Emotion-Lexicon/NRC-Emotion-Lexicon-v0.92/NRC-Emotion-Lexicon-Wordlevel-v0.92.txt
should actually be:
"NRC-Emotion-Lexicon/NRC-Emotion-Lexicon-Wordlevel-v0.92.txt

Thank you for reporting! This has been fixed in d41c432