Cannot parse the Musescore dataset

Question

Cannot parse the Musescore dataset

ta603 opened this issue 4 years ago · comments

Hello. Thank you for the great work.
When executing parse_url() in parse_data.py,
parsed_html.find_all('article', attrs={'role':'article'}) gives nothing.
i.e.,

>>>print(parsed_html.find_all('article', attrs={'role':'article'}))
[]

Would you tell me how to resolve this issue, please?

biboamy · Answer 1 · Tue Feb 25 2020 00:26:44 GMT+0800 (China Standard Time)

Hello, thank you for your interest. It seems that the Musescore website has changed its website structure. I have updated the parser script and now it works again. Please let me know if you have further questions.

Note: For the new crawler, you have to set up the selenium environment on your computer.

ta603 · Answer 2 · Sun Mar 01 2020 00:25:39 GMT+0800 (China Standard Time)

Dear biboamy,
Thank you for the quick updates!