CharsetDetector / UTF-unknown

Character set detector build in C# - .NET 5+, .NET Core 2+, .NET standard 1+ & .NET 4+

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ISO/IEC 8859-11 is a nearly identical version of TIS-620

rstm-sf opened this issue · comments

Hello!

A nearly identical version of TIS-620 has been adopted as ISO/IEC 8859-11 in 2001, the sole difference being that ISO/IEC 8859-11 defines hex A0 as a non-breaking space, while TIS-620 leaves it undefined but reserved. (In practice, this small distinction is usually ignored.)
https://en.wikipedia.org/wiki/Thai_Industrial_Standard_620-2533#Variants

/// <summary>
/// TIS-620 codepage name.
/// </summary>
/// <remarks>
/// TODO: Equal to iso-8859-11?
/// </remarks>
internal const string TIS_620 = "tis-620";

P.S. Thanks @meshy for info by PyYoshi/cChardet#13 (comment)

Now it's part #78