User data from Github https://github.com/jcseg
followers
following
stars
GitHub:@jcseg
Jcseg是基于mmseg算法的一个轻量级开源中文分词器,同时集成了关键字提取,关键短语提取,关键句子提取和文章自动摘要等功能,并且提供了最新版本的lucene, solr, elasticsearch的分词接口。
a Chinese tokenizer
准确率99.9%的ip到地名的映射库,0.0x毫秒级查询,数据库文件大小只有3.5M,提供了java, php, c查询绑定。妈妈再也不同担心我的ip地址定位!
High performance chinese tokenizer with both GBK and UTF-8 charset support developed by ANSI C
Converts text files to wav using Loquendo text-to-speech sdk