leochencipher's repositories
spark-jobserver
REST job server for Spark
hadoop-ansible
Ansible playbook that installs a Hadoop cluster, with HBase, Hive, Presto for analytics, and Ganglia, Smokeping, Fluentd, Elasticsearch and Kibana for monitoring and centralized log indexing.
SparkInternals
Notes talking about the design and implementation of Apache Spark
ansible_ui
ansible web ui, more simple and better layout
ConfigFiles
My Config Files
fluxcapacitor
Flux Capacitor is a Java-based, cloud-native, reference architecture demonstrating many of the Netflix Open Source projects. *** Note: If you're looking for the Spark-based, Big Data Pipeline project, it's here: https://github.com/fluxcapacitor ***
filecrush
Remedy small files by combining them into larger ones.
mdrill
for千亿数据即席分析
vagrant-mesos
Mesos/Docker/Marathon/Aurora development Vagrant Virtualbox setup
NetEase-MusicBox
网易云音乐命令行版本,排行榜,搜索,精选歌单,登录,DJ节目,快速打碟,本地收藏歌单
kafka-web-console
A web console for Apache Kafka
pifs
πfs - the data-free filesystem!
skyline
It'll detect your anomalies! Part of the Kale stack.
youckan
Django-based SSO
bypy
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
HBase-ToHDFS
Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet
docker-perf
Performance monitoring for applications in docker containers
Word2Vec
Word2Vec - Google's word2vec in Scala using UMASS factorie library for better hacking and research.
router
Frontend Router for Gilliam
csc6870-river
CSC 6870 River simulation using palabos
sseredis
Redis PubSub to Server-Sent Event bridge in Go
dubbo
Dubbo is a distributed service framework enpowers applications with service import/export capability with high performance RPC.
WeedFSClient
Java client for WeedFS
hydra
分布式跟踪系统
twitterdig
Dig twitter hot words
metricsd
A metrics aggregator for Graphite
flockdb
A distributed, fault-tolerant graph database
marchingCubeVisualizer
marchingCubeVisualizer: VTK meshes visualizer