Yinghao-Li / ChemistryHTMLPaperParser

Convert HTML/XML Chemistry/Material Science articles into plain text

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Update extraction for Elsevier

Yinghao-Li opened this issue · comments

Extractor fails to extract content from https://www.sciencedirect.com/science/article/pii/S1572665719301237?via%3Dihub.

The issue likely appears at

if record_data or 'sec' in sec_id:

Adding condition and sec_id.startswith('s') might solve the issue.