ruippeixotog / scala-scraper

A Scala library for scraping content from HTML pages

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Waiting for real final rendering

brunierterry opened this issue · comments

@ruippeixotog
You did a great job with this scraper!

Is there a way to extract the content that a page would get asynchronously after the it finished the natural rendering?

Hi @brunierterry, thanks! For that I think you'd need to rely on HtmlUnit-specific features, of which I'm not 100% familiar. Looking at https://htmlunit.sourceforge.io/javascript-howto.html it seems that HtmlUnit has support for common event handlers, so maybe you can use onload?

Closing due to inactivity. Feel free to reopen if you still need support.