Wenchen Fan (cloud-fan)

cloud-fan

Geek Repo

Company:Databricks

Location:Hangzhou, China

Github PK Tool:Github PK Tool

Wenchen Fan's starred repositories

ddia-references

Literature references for “Designing Data-Intensive Applications”

Stargazers:5697Issues:0Issues:0

mlflow

Open source platform for the machine learning lifecycle

Language:PythonLicense:Apache-2.0Stargazers:18035Issues:0Issues:0

koalas

Koalas: pandas API on Apache Spark

Language:PythonLicense:Apache-2.0Stargazers:3326Issues:0Issues:0

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Language:ScalaLicense:Apache-2.0Stargazers:7317Issues:0Issues:0

arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Language:C++License:Apache-2.0Stargazers:14012Issues:0Issues:0

perfj

PerfJ is a wrapper of linux perf for java programs.

Language:CLicense:GPL-2.0Stargazers:352Issues:0Issues:0

CMAK

CMAK is a tool for managing Apache Kafka clusters

Language:ScalaLicense:Apache-2.0Stargazers:11764Issues:0Issues:0

json4s

JSON library

Language:ScalaLicense:Apache-2.0Stargazers:1477Issues:0Issues:0

scala-graph

Graph for Scala is intended to provide basic graph functionality seamlessly fitting into the Scala Collection Library. Like the well known members of scala.collection, Graph for Scala is an in-memory graph library aiming at editing and traversing graphs, finding cycles etc. in a user-friendly way.

Language:ScalaLicense:Apache-2.0Stargazers:560Issues:0Issues:0

free-programming-books

:books: Freely available programming books

License:CC-BY-4.0Stargazers:330135Issues:0Issues:0

db-readings

Readings in Databases

Stargazers:7576Issues:0Issues:0

SparkInternals

Notes talking about the design and implementation of Apache Spark

Stargazers:5243Issues:0Issues:0

echarts

Apache ECharts is a powerful, interactive charting and data visualization library for browser

Language:TypeScriptLicense:Apache-2.0Stargazers:59793Issues:0Issues:0

shapeless

Generic programming for Scala

Language:ScalaLicense:Apache-2.0Stargazers:3375Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:39015Issues:0Issues:0