Yann Byron's repositories

incubator-paimon

Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.

License:Apache-2.0Stargazers:0Issues:0Issues:0

datafusion

Apache DataFusion SQL Query Engine

License:Apache-2.0Stargazers:0Issues:0Issues:0

datafusion-comet

Apache DataFusion Comet Spark Accelerator

License:Apache-2.0Stargazers:0Issues:0Issues:0

lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

License:Apache-2.0Stargazers:0Issues:0Issues:0

hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

License:Apache-2.0Stargazers:0Issues:0Issues:0

incubator-celeborn

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

License:Apache-2.0Stargazers:0Issues:0Issues:0

velox

A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.

License:Apache-2.0Stargazers:1Issues:0Issues:0

arctic

Arctic is a streaming lake warehouse service open sourced by NetEase

License:Apache-2.0Stargazers:0Issues:0Issues:0

connectors

Connectors for Delta Lake

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

incubator-toree

Mirror of Apache Toree (Incubating)

License:Apache-2.0Stargazers:0Issues:0Issues:0

hyperspace

An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.

License:Apache-2.0Stargazers:0Issues:0Issues:0

coder2gwy

互联网首份程序员考公指南,由3位已经进入体制内的前大厂程序员联合献上。

Stargazers:0Issues:0Issues:0

flink

Apache Flink

License:Apache-2.0Stargazers:0Issues:0Issues:0

flink-learning

flink learning blog. http://www.54tianzhisheng.cn 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

License:Apache-2.0Stargazers:0Issues:0Issues:0

simple-rpc

A simple rpc framework.

Language:JavaStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

dr-elephant

Performance monitoring and tuning tool for Apache Hadoop

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

git

Git Source Code Mirror - This is a publish-only repository and all pull requests are ignored. Please follow Documentation/SubmittingPatches procedure for any of your improvements.

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

prog-scala-2nd-ed-code-examples

The code examples used in Programming Scala, 2nd Edition (O'Reilly)

Language:ScalaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

CoolplaySpark

酷玩 Spark: Spark 源代码解析、Spark 类库等

Language:ScalaStargazers:0Issues:0Issues:0

hbase-rdd

Spark RDD to read and write from HBase

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

framework

Lift Framework

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLLicense:NOASSERTIONStargazers:0Issues:0Issues:0

scikit-learn

scikit-learn: machine learning in Python

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MyNotes

Self-written notes that may be useful

Language:PythonStargazers:0Issues:0Issues:0

data-algorithms-book

MapReduce and Spark Source Code and Scripts for Data Algorithms Book

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0