monorioa

monorioa

Geek Repo

0

followers

0

following

Location:Beijing

Github PK Tool:Github PK Tool

monorioa's starred repositories

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:39381Issues:2022Issues:0

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37921Issues:999Issues:1143

Administrative-divisions-of-China

中华人民共和国行政区划:省级(省份)、 地级(城市)、 县级(区县)、 乡级(乡镇街道)、 村级(村委会居委会) ,**省市区镇村二级三级四级五级联动地址数据。

Language:JavaScriptLicense:WTFPLStargazers:18436Issues:387Issues:128

elasticsearch-sql

Use SQL to query Elasticsearch

Language:JavaLicense:Apache-2.0Stargazers:6991Issues:457Issues:986

flink-training-course

Flink 中文视频课程(持续更新...)

dubbo-admin

The ops and reference implementation for Apache Dubbo

Language:JavaLicense:Apache-2.0Stargazers:3988Issues:230Issues:733

RecSys

计算广告/推荐系统/机器学习(Machine Learning)/点击率(CTR)/转化率(CVR)预估/点击率预估

word2vec-api

Simple web service providing a word embedding model

deepnlp

Deep Learning NLP Pipeline implemented on Tensorflow

Language:PythonLicense:MITStargazers:1345Issues:134Issues:58

ner

命名实体识别实践与探索

BERT-for-Sequence-Labeling-and-Text-Classification

This is the template code to use BERT for sequence lableing and text classification, in order to facilitate BERT for more tasks. Currently, the template code has included conll-2003 named entity identification, Snips Slot Filling and Intent Prediction.

Language:PythonLicense:Apache-2.0Stargazers:467Issues:10Issues:15

pointer-generator

Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks" (Python3)

Language:PythonLicense:NOASSERTIONStargazers:311Issues:9Issues:0

2019-CCF-BDCI-Car_sales

2019年CCF大数据与计算智能大赛乘用车细分市场销量预测冠军解决方案

china-divisions

📍**行政区划地址库 SDK + 爬虫 + 数据。

Language:PHPLicense:MITStargazers:201Issues:7Issues:6

fnc-1

Fake News Challenge

Language:PythonLicense:Apache-2.0Stargazers:178Issues:14Issues:5

cnn-dailymail

Code to obtain the CNN / Daily Mail dataset (non-anonymized) for summarization (Python3)

Language:PythonStargazers:143Issues:2Issues:0

fnc-1-baseline

A baseline implementation for FNC-1

Language:PythonLicense:Apache-2.0Stargazers:137Issues:20Issues:1

LtpExtraction

基于ltp的简单评论观点抽取模块

Language:Jupyter NotebookStargazers:116Issues:2Issues:2

ChineseAntiword

chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口

Language:PythonStargazers:58Issues:3Issues:0

aided_writing

基于C#和C++开发的辅助写作工具。可基于大规模语料库构建自动补全索引,实现千万字次级的语料的实时提示

Language:C#License:AGPL-3.0Stargazers:55Issues:2Issues:2

cf_gbdt_lr

简单的实现推荐系统的召回模型和排序模型,其中召回模型使用协同过滤算法,排序模型使用gbdt+lr算法

Language:PythonStargazers:55Issues:0Issues:0

-TOP1-

CCF大数据与计算智能大赛-工件检测TOP1方案

Language:Jupyter NotebookStargazers:27Issues:1Issues:2

gmt-china.org

GMT 中文社区主页

crees

Crisis Event Extraction Service (CREES)

Language:PythonLicense:Apache-2.0Stargazers:16Issues:3Issues:0

KMeansCluster

A java implementation of k-means algorithm.It uses ball tree as internal data structure to accelerate the computation.It uses 2-norm distance to compute the similarity between instances.

Language:JavaStargazers:11Issues:1Issues:0

EntityResolution

实体统一的代码实现

Language:Jupyter NotebookStargazers:8Issues:2Issues:0

Event_Extraction

A simple implement of event extraction

Language:PythonStargazers:4Issues:0Issues:0

TextSimilarity

这是一个类,里面包含的有关文本相似度的常用的计算算法,例如,最长公共子序列,最短标记距离,TF-IDF等算法

Language:PythonStargazers:4Issues:0Issues:0

SparkStreamingElastic

Read the data in elasticsearch through sparkstreaming

Language:JavaStargazers:1Issues:0Issues:0