Elliot's repositories

cat

Central Application Tracking

Language:JavaScriptLicense:Apache-2.0Stargazers:1Issues:0Issues:0

spark-ml-source-analysis

spark ml 算法原理剖析以及具体的源码实现分析

License:Apache-2.0Stargazers:1Issues:0Issues:0

xgboost

Large-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.

Language:C++License:NOASSERTIONStargazers:1Issues:0Issues:0

angel

A Flexible and Powerful Parameter Server for large-scale machine learning

Language:JavaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

ansj_seg

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

camus

LinkedIn's Kafka to HDFS pipeline.

Language:JavaStargazers:0Issues:0Issues:0

canal

阿里巴巴mysql数据库binlog的增量订阅&消费组件

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

cws_evaluation

Java开源项目cws_evaluation:中文分词器分词效果评估对比

Language:LexLicense:Apache-2.0Stargazers:0Issues:0Issues:0

deeplearningbook-chinese

Deep Learning Book Chinese Translation

Language:TeXStargazers:0Issues:0Issues:0

disconf

Distributed Configuration Management Platform(分布式配置管理平台)

Language:JavaLicense:GPL-2.0Stargazers:0Issues:0Issues:0

elasticsearch

Open Source, Distributed, RESTful Search Engine

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

Familia

A Toolkit for Chinese Topic Modeling

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0

FM_FTRL

Hashed Factorization Machine with Follow The Regularized Leader for Kaggle Avazu Click-Through Rate Competition

Language:PythonStargazers:0Issues:0Issues:0

fnlp

中文自然语言处理工具包 Toolkit for Chinese natural language processing

Language:JavaLicense:LGPL-3.0Stargazers:0Issues:0Issues:0

gobblin

Universal data ingestion framework for Hadoop.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

HanLP

汉语言处理包 中文分词 词性标注 命名实体识别 依存句法分析 关键词提取 自动摘要 短语提取 拼音 简繁转换

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

incubator-airflow

Apache Airflow (Incubating)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

jieba

结巴中文分词

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

jstorm

Java Storm

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kafka-manager

A tool for managing Apache Kafka.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

KafkaOffsetMonitor

A little app to monitor the progress of kafka consumers and their lag wrt the queue.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

learning-spark

Example code from Learning Spark book

Language:JavaLicense:MITStargazers:0Issues:0Issues:0
Language:CLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

liblinear-java

Java version of LIBLINEAR

Language:JavaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

ltp

Language Technology Platform

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

Online-Random-Bit-Regression-FTRL

Online Random Bit Regression with FTRL-Proximal in Python

Language:PythonStargazers:0Issues:0Issues:0

scikit-learn

scikit-learn: machine learning in Python

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

snownlp

Python library for processing Chinese text

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ssdb

SSDB - A fast NoSQL database, an alternative to Redis

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0