Bayesian classifier cann't distinguish "SQL" vs "PLpgSQL"
bzz opened this issue · comments
Alex commented
Part of the #155.
After update to latest samples in #189, Bayesian classifier test fail to distinguish "SQL" vs "PLpgSQL" based only on content. Classifier weights are different in enry/linguist for the same document #189 (comment)
This most probably this has to do with with difference between tokenizations between two projects that going to be addressed in #193