Add Detectors and Probers for target languages

Question

Add Detectors and Probers for target languages

rstm-sf opened this issue 5 years ago · comments

Rustam commented 5 years ago

Hello!

It may be worth adding the ability to determine the encoding if you know which target language?

Julian Verdurmen · Answer 1 · Sat Aug 24 2019 20:25:25 GMT+0800 (China Standard Time)

Hi,

Sorry for the late response.

What do you mean with this?

Rustam · Answer 2 · Sun Sep 01 2019 03:54:39 GMT+0800 (China Standard Time)

Hello!

I created a pr #63 for ease of understanding.

In order to detect the encoding prober's objects are created. They are defined for multiple languages. With a small sample of characters to detect the encoding, conflicts may arise between the encodings due to the possibility of being a character code in different languages.

But, what if we need to define an encoding, knowing that it can belong to only one language? Then you can restrict yourself to probers only for a given language, reducing the likelihood of incorrect detections.

PS. Sorry for my english.

Julian Verdurmen · Answer 3 · Sun Sep 22 2019 06:07:41 GMT+0800 (China Standard Time)

sound good, but now sure how easy it is to change that is this code base.

Rustam · Answer 4 · Sat Nov 09 2019 19:43:56 GMT+0800 (China Standard Time)

It seems to me that first we need to try to single out single-byte probers by language, as models

Rustam · Answer 5 · Mon Feb 24 2020 15:53:28 GMT+0800 (China Standard Time)

Hello, @304NotModified !

We can make breaking changes and override, using internal, everything that is in src/Core? This would make it easier to change the code.

Julian Verdurmen · Answer 6 · Tue Feb 25 2020 00:52:11 GMT+0800 (China Standard Time)

do you mean if making breaking changes in src/core is OK? I think it is. We should make them internal also

Rustam · Answer 7 · Tue Feb 25 2020 01:18:18 GMT+0800 (China Standard Time)

I think it would be nice if we could just change the source in src/core without thinking about breaking changes. That is, change the modifier from public to internal.

I just have the idea of separating probers as models into languages (however, it will take a lot of time, there are about 100 of them). And it would be nice then to change the namespace