方括号干扰以英文字母结尾的用户自定义词分词结果
ldwnt opened this issue · comments
版本:ansj_seg-5.1.3.jar
public class Test {
public static void main(String[] sts) {
List<Value> values = new ArrayList<>();
values.add(new Value("农银货币B", new String[]{"n", "1000"}));
Forest forest = Library.makeForest(values);
Analysis analysis = new DicAnalysis().setForests(forest, DicLibrary.get());
List<Term> terms = analysis.parseStr("农银货币B 农银货币B[]").getTerms();
System.out.println(terms);
}
}
执行结果:
[农银货币B/n, , 农/ng, 银/ng, 货币/n, b/en]
依据org.nlpcn.commons.lang.util.WordAlert#isEnglish(char)和org.nlpcn.commons.lang.util.WordAlert#isNumber(char),
修改
org.nlpcn.commons.lang.tire.SmartGetWord#isE
org.nlpcn.commons.lang.tire.SmartGetWord#isNum
依据org.nlpcn.commons.lang.util.WordAlert#isEnglish(char)和org.nlpcn.commons.lang.util.WordAlert#isNumber(char),
修改
org.nlpcn.commons.lang.tire.SmartGetWord#isE
org.nlpcn.commons.lang.tire.SmartGetWord#isNum
多谢,我试下~