biboamy / instrument-streaming

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Cannot parse the Musescore dataset

ta603 opened this issue · comments

commented

Hello. Thank you for the great work.
When executing parse_url() in parse_data.py,
parsed_html.find_all('article', attrs={'role':'article'}) gives nothing.
i.e.,

>>>print(parsed_html.find_all('article', attrs={'role':'article'}))
[]

Would you tell me how to resolve this issue, please?

Hello, thank you for your interest. It seems that the Musescore website has changed its website structure. I have updated the parser script and now it works again. Please let me know if you have further questions.

Note: For the new crawler, you have to set up the selenium environment on your computer.

commented

Dear biboamy,
Thank you for the quick updates!