Fix broken HTML before it is parsed
pboese opened this issue · comments
Heya,
the site I'm crawling has some really bad HTML issues that mess up parsing and I need to manipulate the HTML before handing it over to the parser. Is that possible and if so, how?
Thanks for any suggestions!
Sorry, should be a discussion