Cannot parse the Musescore dataset
ta603 opened this issue · comments
Hello. Thank you for the great work.
When executing parse_url() in parse_data.py,
parsed_html.find_all('article', attrs={'role':'article'})
gives nothing.
i.e.,
>>>print(parsed_html.find_all('article', attrs={'role':'article'}))
[]
Would you tell me how to resolve this issue, please?
Hello, thank you for your interest. It seems that the Musescore website has changed its website structure. I have updated the parser script and now it works again. Please let me know if you have further questions.
Note: For the new crawler, you have to set up the selenium environment on your computer.
Dear biboamy,
Thank you for the quick updates!