willw (top1select)

top1select

Geek Repo

Github PK Tool:Github PK Tool

willw's repositories

mango

A scalable genome browser. Apache 2 licensed.

License:Apache-2.0Stargazers:0Issues:0Issues:0

adam

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

License:Apache-2.0Stargazers:0Issues:0Issues:0

delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

License:Apache-2.0Stargazers:0Issues:0Issues:0

flink

Apache Flink

License:Apache-2.0Stargazers:0Issues:0Issues:0

kafka

Mirror of Apache Kafka

License:Apache-2.0Stargazers:0Issues:0Issues:0

biojava

:book::microscope::coffee: BioJava is an open-source project dedicated to providing a Java library for processing biological data.

License:LGPL-2.1Stargazers:0Issues:0Issues:0

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

algorithms-sedgewick-wayne

Solutions to the exercises of the Algorithms book by Robert Sedgewick and Kevin Wayne

Language:JavaLicense:MITStargazers:0Issues:0Issues:0
Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sparklens

Qubole Sparklens tool for performance tuning Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Classification-Pyspark

This repository of classification template using pyspark.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

diesel

A safe, extensible ORM and Query Builder for Rust

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0

redis-rs

Redis library for rust

Language:RustLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

feature-engineering-for-ml-zh

:book: [译] 面向机器学习的特征工程

Language:HTMLStargazers:0Issues:0Issues:0

tispark

TiSpark is built for running Apache Spark on top of TiDB/TiKV

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

blog

Everything about database,bussiness.(Most for PostgreSQL).

Language:PLpgSQLLicense:GPL-2.0Stargazers:1Issues:0Issues:0

JavaGuide

【Java学习+面试指南】 一份涵盖大部分Java程序员所需要掌握的核心知识。

Language:JavaStargazers:0Issues:0Issues:0

scala

The Scala programming language

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

scala-best-practices

A collection of Scala best practices

Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Optimus

:truck: Agile Data Science Workflows made easy with Python and Spark.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

opentsdb

A scalable, distributed Time Series Database.

Language:JavaLicense:LGPL-2.1Stargazers:0Issues:0Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tensorflow-without-a-phd

A crash course in six episodes for software developers who want to become machine learning practitioners.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DeepLearning-500-questions

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为15个章节,近20万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06

Stargazers:0Issues:0Issues:0

TensorFlowOnSpark

TensorFlowOnSpark brings TensorFlow programs onto Apache Spark clusters

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

OryxML

OryxML is a realization of the lambda architecture based on Oryx 2, using Apache Spark and Apache Kafka for real-time large scale machine learning.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

kubernetes-handbook

Kubernetes中文指南/云原生应用架构实践手册 - https://jimmysong.io/kubernetes-handbook

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0