Can not get the image from a Chinese page even the text

Question

Can not get the image from a Chinese page even the text

SheldonWang3000 opened this issue 9 years ago · comments

I just wrote the code like the sample but I cannot get the image or text from the other page,is that a bug?or I need any other configuration?
I have config the StopWordsChinese

jijingg · Answer 1 · Sat Jun 13 2015 23:58:25 GMT+0800 (China Standard Time)

touch the test url pls

Sheldon · Answer 2 · Sun Jun 14 2015 11:03:59 GMT+0800 (China Standard Time)

http://wz.sun0769.com/html/question/201506/278531.shtml
like this page
but I find out that some page in Chinese can be extracted some cannot