sv24-archive / charade

NO LONGER MAINTAINED. USE chardet/chardet. Fork of chardet to support Python 2 and 3 in one code base.

Home Page:https://github.com/kennethreitz/requests/issues/951

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Charade 1.0.3 Badly identifies ISO-8859-15 as IBM855

HardbitCoded opened this issue · comments

Hello;
The above charade's version identifies the ISO-8859-15 as IBM855.

rui@rui-SatelliteI7:/Transferências$ file Bones.S09E09.HDTV.X264-LOL.srt
Bones.S09E09.HDTV.X264-LOL.srt: C++ source, ISO-8859 text, with CRLF line terminators
rui@rui-SatelliteI7:
/Transferências$ charade Bones.S09E09.HDTV.X264-LOL.srt
Bones.S09E09.HDTV.X264-LOL.srt: IBM855 with confidence 0.972957810694

Can you please check?

I don't know the code of charade well but would it be possible to specify an order for the detectors to run? That would solve the problem and might be useful for many use cases where the user has "a clue" of the encoding.