53seven / node-blindparser

blindparser is an all purpose RSS/ATOM feed parser that parses feeds into a common format so that you do not have to care if they are RSS or ATOM feeds

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

UTF8

jcar787 opened this issue · comments

Very nice parse. My question is when i have special characters like áéíóú etc, the parser return a ? instead of the character or utf8 code.
Any help is appreciated

Can you create a gist with some sample data so that I can have some inputs to test against? Thanks

Tell me if that is gonna help you. Thanks!

After trying to find the error, I think the issue is with the encoding of the source RSS feed itself. Other feeds that use UTF-8 are rendering properly.

Are you having issues with other feeds as well?

I was able to get proper output from these feeds:

http://arabic.cnn.com/rss/cnnarabic_topnews.rss
http://cnnespanol.cnn.com/feed

Its weird because in chrome the feed gives me special characters but not in blindparser. Another feeds return the special characters properly. Thanks.

Just as a head up, the file that you are trying to read is not UTF-8. The XML header states that the character encoding is iso-8859-1. My guess is that this is the source of your issues.