matthewmueller / x-ray

The next web scraper. See through the <html> noise.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

filter only selected words?

waptik opened this issue · comments

Subject of the issue

I'm crawling a webpage that has unwanted contents in my scope.
I want to select tag that do not contain certain words in their content.
jsoup has something like :contains() selector.
can someone help me?

sample html can be found in this gist