willw's repositories
mango
A scalable genome browser. Apache 2 licensed.
adam
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
flink
Apache Flink
kafka
Mirror of Apache Kafka
biojava
:book::microscope::coffee: BioJava is an open-source project dedicated to providing a Java library for processing biological data.
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
algorithms-sedgewick-wayne
Solutions to the exercises of the Algorithms book by Robert Sedgewick and Kevin Wayne
sparklens
Qubole Sparklens tool for performance tuning Apache Spark
Classification-Pyspark
This repository of classification template using pyspark.
diesel
A safe, extensible ORM and Query Builder for Rust
redis-rs
Redis library for rust
feature-engineering-for-ml-zh
:book: [译] 面向机器学习的特征工程
tispark
TiSpark is built for running Apache Spark on top of TiDB/TiKV
JavaGuide
【Java学习+面试指南】 一份涵盖大部分Java程序员所需要掌握的核心知识。
scala
The Scala programming language
scala-best-practices
A collection of Scala best practices
Optimus
:truck: Agile Data Science Workflows made easy with Python and Spark.
opentsdb
A scalable, distributed Time Series Database.
spark
Mirror of Apache Spark
tensorflow-without-a-phd
A crash course in six episodes for software developers who want to become machine learning practitioners.
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为15个章节,近20万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs onto Apache Spark clusters
OryxML
OryxML is a realization of the lambda architecture based on Oryx 2, using Apache Spark and Apache Kafka for real-time large scale machine learning.
kubernetes-handbook
Kubernetes中文指南/云原生应用架构实践手册 - https://jimmysong.io/kubernetes-handbook