Kenia Narsus's starred repositories

terraform

Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared amongst team members, treated as code, edited, reviewed, and versioned.

Language:GoLicense:NOASSERTIONStargazers:41749Issues:1169Issues:20864

run

润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新**人的核心宗教,核心信念。

skywalking

APM, Application Performance Monitoring System

Language:JavaLicense:Apache-2.0Stargazers:23542Issues:840Issues:5255

dataease

🔥 人人可用的开源数据可视化分析工具,帆软、Tableau 等商业 BI 工具的开源替代。

Language:JavaLicense:GPL-3.0Stargazers:16410Issues:153Issues:4403

nomad

Nomad is an easy-to-use, flexible, and performant workload orchestrator that can deploy a mix of microservice, batch, containerized, and non-containerized applications. Nomad is easy to operate and scale and has native Consul and Vault integrations.

Language:GoLicense:NOASSERTIONStargazers:14604Issues:537Issues:6826

soar

SQL Optimizer And Rewriter

Language:GoLicense:Apache-2.0Stargazers:8631Issues:279Issues:237

datafusion

Apache DataFusion SQL Query Engine

Language:RustLicense:Apache-2.0Stargazers:5602Issues:105Issues:4669

volcano

A Cloud Native Batch System (Project under CNCF)

Language:GoLicense:Apache-2.0Stargazers:3941Issues:90Issues:1494

paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Language:JavaLicense:Apache-2.0Stargazers:2165Issues:75Issues:1023

SREWorks

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台

Language:JavaLicense:Apache-2.0Stargazers:1741Issues:53Issues:60

marquez

Collect, aggregate, and visualize a data ecosystem's metadata

Language:JavaLicense:Apache-2.0Stargazers:1693Issues:47Issues:770

apache-spark-internals

The Internals of Apache Spark

koordinator

A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.

Language:GoLicense:Apache-2.0Stargazers:1266Issues:28Issues:554

loggie

A lightweight, cloud-native data transfer agent and aggregator

Language:GoLicense:Apache-2.0Stargazers:1219Issues:24Issues:219

incubator-gluten

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Language:ScalaLicense:Apache-2.0Stargazers:1082Issues:40Issues:1848

blaze

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Language:RustLicense:Apache-2.0Stargazers:1004Issues:22Issues:78

yunikorn-core

Apache YuniKorn Core

Language:GoLicense:Apache-2.0Stargazers:775Issues:46Issues:0

incubator-teaclave

Apache Teaclave (incubating) is an open source universal secure computing platform, making computation on privacy-sensitive data safe and simple.

Language:RustLicense:Apache-2.0Stargazers:751Issues:54Issues:219

sparkMeasure

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.

Language:ScalaLicense:Apache-2.0Stargazers:676Issues:33Issues:39

Miscellaneous

Includes notes on using Apache Spark in general, notes on using Spark for Physics, how to run TPCDS on PySpark, how to create histograms with Spark, tools for performance testing CPUs, Jupyter notebooks examples for Spark, examples for Oracle and other DB systems.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:410Issues:25Issues:6

delight

A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.

Language:ScalaLicense:NOASSERTIONStargazers:341Issues:16Issues:13

gazelle_plugin

Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.

Language:ScalaLicense:Apache-2.0Stargazers:256Issues:19Issues:550

flink-remote-shuffle

Remote Shuffle Service for Flink

Language:JavaLicense:Apache-2.0Stargazers:190Issues:10Issues:53

k8s-spark-scheduler

A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes

Language:GoLicense:Apache-2.0Stargazers:175Issues:239Issues:19

shuttle

Shuttle:High Available, High Performance Remote Shuffle Service

Language:JavaLicense:Apache-2.0Stargazers:152Issues:6Issues:6

yunikorn-k8shim

Apache YuniKorn K8shim

Language:GoLicense:Apache-2.0Stargazers:107Issues:24Issues:0

StarLake

A New Way of Data Lake

Language:ScalaLicense:Apache-2.0Stargazers:48Issues:5Issues:0

kun-scheduler

A workflow scheduler understands both your data and metadata.

Language:JavaLicense:NOASSERTIONStargazers:27Issues:6Issues:159

sql-calculator

这是一个基于 TiDB MySQL 语法解析器的一个工具集,支持1. SQL 指纹(sql fingerprint);2. 数据库库表对比(sql diff): 对比两个数据库的库表差异,并生成源库到目标库对应的差异( DDL) 语句。

Language:GoLicense:Apache-2.0Stargazers:21Issues:1Issues:2

spark-ui-reverse-proxy

This project provides a reverse proxy for Spark UI on Kubernetes

Language:GoLicense:Apache-2.0Stargazers:13Issues:1Issues:0