mozilla / readability

A standalone version of the readability lib

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Add `articleBody` to the metadata when found in the Article Schema Markup

tarekziade opened this issue · comments

A website like CNN provides the whole article body in its Article Schema Markup block in the articleBody

Readability could copy that value in _getArticleMetadata and return it in parse

Maybe it could also be leveraged to improve the parser output

Happy to do a patch :)

I should mention that articleBody is part of the standard https://schema.org/Article