grangier / python-goose

Html Content / Article Extractor, web scrapping lib in Python

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

h1,h2...h6 not returned

tamimibrahim opened this issue · comments

When I extracted articles from any page, I have noticed it don't return any heading "tag" like h1,h2...h6 value in cleaned_text.

Is that normal for everyone or I have missed anything?

This is a feature that would be indeed nice. I don't think it's part the current version of the project.