Evan Sun (abzymeinsjtu)

abzymeinsjtu

Geek Repo

Company:alibaba cloud

Location:Shanghai

Github PK Tool:Github PK Tool

Evan Sun's repositories

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

aws_notebook

aws notebook

Stargazers:0Issues:1Issues:0

dask

Parallel computing with task scheduling

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

dolphinscheduler

Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dva-example-user-dashboard

👲 👬 👨‍👩‍👧 👨‍👩‍👦‍👦

Language:JavaScriptStargazers:0Issues:1Issues:0

elyra

Elyra extends JupyterLab Notebooks with an AI centric approach.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

enterprise_gateway

A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

hadoop-yarn-api-python-client

Python client for Hadoop® YARN API

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hue

Open source SQL Query Assistant service for Databases/Warehouses

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

incubator-livy

Mirror of Apache livy (Incubating)

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

incubator-seatunnel

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ipython-sql

%%sql magic for IPython, hopefully evolving into full SQL client

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

jupyter_client

Jupyter protocol client APIs

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

jupyter_server

The backend—i.e. core services, APIs, and REST endpoints—to Jupyter web applications.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

jupyterhub

Multi-user server for Jupyter notebooks

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mage-ai

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

python

Official Python client library for kubernetes

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

scala

Scala 2 compiler and standard library. For bugs, see scala/bug

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

skein

A tool and library for easily deploying applications on Apache YARN

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sudospawner

Spawn JupyterHub single-user servers with sudo

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

superset

Apache Superset is a Data Visualization and Data Exploration Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

watchdog

Python library and shell utilities to monitor filesystem events.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0