Simon Lee's repositories
ansj_fast_lda
LDA 的java实现
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
awesome-machine-learning-cn
机器学习资源大全中文版,包括机器学习领域的框架、库以及软件
carrot2
Carrot2: Text Clustering Algorithms and Applications
CNN_sentence
CNNs for sentence classification
codis
Proxy based Redis cluster solution supporting pipeline and scaling dynamically
Conjecture
Scalable Machine Learning in Scalding
dockviz
Visualizing docker data
epic
Epic is a high performance statistical parser written in Scala, along with a framework for building complex structured prediction models.
example-spark
Spark, Spark Streaming and Spark SQL unit testing strategies
flask
A microframework based on Werkzeug, Jinja2 and good intentions
flume
Mirror of Apache Flume
hazelcast-scala
Scala language support for Hazelcast
jQCloud
jQuery plugin for drawing neat word clouds that actually look like clouds
json4s
A single AST to be used by other scala json libraries
kafka-storm-starter
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
MachineLearning
This project contain some machine learning algrithm demo.Maybe the code is also useful to you.
onyx
Distributed, masterless, high performance, fault tolerant data processing
ppmessage
PPMessage - Plug and Play Online Customer Service, Customer Communication, Web Chat, Instant Message, iOS Android In-App Messaging SDK, Intercom Alternative, Implemented with pure Python
PredictionIO
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
scalaj-http
Simple scala wrapper for HttpURLConnection. OAuth included.
skynet
A lightweight online game framework
solr-scala-client
Solr Client for Scala
solrs
A solr client for scala, providing a query interface like SolrJ, just asynchronously / non-blocking
spark_study
spark源码学习
SparkOnHBase
SparkOnHBase
Sparta
Real Time Aggregation based on Spark Streaming