monorioa's starred repositories
Administrative-divisions-of-China
中华人民共和国行政区划:省级(省份)、 地级(城市)、 县级(区县)、 乡级(乡镇街道)、 村级(村委会居委会) ,**省市区镇村二级三级四级五级联动地址数据。
elasticsearch-sql
Use SQL to query Elasticsearch
flink-training-course
Flink 中文视频课程(持续更新...)
dubbo-admin
The ops and reference implementation for Apache Dubbo
word2vec-api
Simple web service providing a word embedding model
BERT-for-Sequence-Labeling-and-Text-Classification
This is the template code to use BERT for sequence lableing and text classification, in order to facilitate BERT for more tasks. Currently, the template code has included conll-2003 named entity identification, Snips Slot Filling and Intent Prediction.
pointer-generator
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks" (Python3)
2019-CCF-BDCI-Car_sales
2019年CCF大数据与计算智能大赛乘用车细分市场销量预测冠军解决方案
china-divisions
📍**行政区划地址库 SDK + 爬虫 + 数据。
cnn-dailymail
Code to obtain the CNN / Daily Mail dataset (non-anonymized) for summarization (Python3)
fnc-1-baseline
A baseline implementation for FNC-1
LtpExtraction
基于ltp的简单评论观点抽取模块
ChineseAntiword
chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口
aided_writing
基于C#和C++开发的辅助写作工具。可基于大规模语料库构建自动补全索引,实现千万字次级的语料的实时提示
cf_gbdt_lr
简单的实现推荐系统的召回模型和排序模型,其中召回模型使用协同过滤算法,排序模型使用gbdt+lr算法
gmt-china.org
GMT 中文社区主页
KMeansCluster
A java implementation of k-means algorithm.It uses ball tree as internal data structure to accelerate the computation.It uses 2-norm distance to compute the similarity between instances.
EntityResolution
实体统一的代码实现
Event_Extraction
A simple implement of event extraction
TextSimilarity
这是一个类,里面包含的有关文本相似度的常用的计算算法,例如,最长公共子序列,最短标记距离,TF-IDF等算法
SparkStreamingElastic
Read the data in elasticsearch through sparkstreaming