qinwf / jiebaR

Chinese text segmentation with R. R语言中文分词 (文档已更新 🎉 :https://qinwenfeng.com/jiebaR/ )

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

run worker() function 出现R session aborted 是怎么回事?

liaoshuai1 opened this issue · comments

这是代码
rm(list=ls())
###分词
#install.packages('Rcpp')
#install.packages('jiebaR')
library(Rcpp)
library(jiebaRD)
library(jiebaR)

en = "R is my favorite programming language."
cn = "R语言是我最喜爱的编程语言"

wk=jiebaR::worker() #worker()函数是用来初始化分词引擎的,默认为混合模型。
segment(en,wk)

segment(cn,wk)

##worker其中type是算法类型,dict是字典来源,如果不设置就会使用jiebaR默认的字典,同样的hmm路径和user路径也一样。symbol选项是F表示不保留标点。此外,还可以设置停词字典,更多详细具体的设置可以在R语言里用?worker()查看。
cc2 = jiebaR::worker(type = "mix", dict = "dict/jieba.dict.utf8",
hmm = "dict/hmm_model.utf8",
user = "dict/test.dict.utf8",
symbol = F
)
###https://zhuanlan.zhihu.com/p/24882048

报错:2022-01-12 00:21:33 ../inst/include/lib/DictTrie.hpp:159 FATAL exp: [ifs.is_open()] false. open dict/jieba.dict.utf8 failed.