Chengzhi Zhao (ChengzhiZhao)

ChengzhiZhao

Geek Repo

Location:United States

Github PK Tool:Github PK Tool

Chengzhi Zhao's starred repositories

awesome

😎 Awesome lists about all kinds of interesting topics

coding-interview-university

A complete computer science study plan to become a software engineer.

fucking-algorithm

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

metabase

The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:

Language:ClojureLicense:NOASSERTIONStargazers:37749Issues:641Issues:19580

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Language:RustLicense:NOASSERTIONStargazers:28668Issues:160Issues:8374

awesome-datascience

:memo: An awesome Data Science repository to learn and apply for real world problems.

duckdb

DuckDB is an analytical in-process SQL database management system

vector

A high-performance observability data pipeline.

Language:RustLicense:MPL-2.0Stargazers:17274Issues:152Issues:7505

BigData-Notes

大数据入门指南 :star:

prefect

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Language:PythonLicense:Apache-2.0Stargazers:15625Issues:162Issues:5337

arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Language:C++License:Apache-2.0Stargazers:14101Issues:354Issues:25399

dask

Parallel computing with task scheduling

Language:PythonLicense:BSD-3-ClauseStargazers:12330Issues:211Issues:5127

minimal-mistakes

:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.

dagster

An orchestration platform for the development, production, and observation of data assets.

Language:PythonLicense:Apache-2.0Stargazers:10973Issues:118Issues:7163

kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

Language:PythonLicense:Apache-2.0Stargazers:9751Issues:107Issues:1901

God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Language:PythonLicense:Apache-2.0Stargazers:7812Issues:108Issues:357

obsidian-dataview

A data index and query language over Markdown files, for https://obsidian.md/.

Language:TypeScriptLicense:MITStargazers:6728Issues:40Issues:1315

tenacity

Retrying library for Python

Language:PythonLicense:Apache-2.0Stargazers:6438Issues:47Issues:260

docker-spark

Apache Spark docker image

blazingsql

BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.

Language:C++License:Apache-2.0Stargazers:1923Issues:55Issues:715

2D-Character-Controller

Free 2D Character Controller for Unity.

hop

Hop Orchestration Platform

Language:JavaLicense:Apache-2.0Stargazers:912Issues:47Issues:1456

cutecharts.py

📉 Hand drawing style charts library for Python

Language:PythonLicense:MITStargazers:752Issues:19Issues:11

networkD3

D3 JavaScript Network Graphs from R

nbpreview

Render Jupyter/IPython notebooks without running a notebook server.

Language:CSSLicense:MITStargazers:287Issues:11Issues:7

superglue

Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs and reports.

Language:ScalaLicense:Apache-2.0Stargazers:154Issues:12Issues:17

sherlock

Sherlock is an anomaly detection service built on top of Druid

Language:JavaLicense:NOASSERTIONStargazers:150Issues:16Issues:25

Flink-Forward-Asia-2019

Flink Forward Asia 2019 PPT以及视频资料

spark-structured-streaming-jdbc-sink

Spark Structured Streaming JDBC Sink

Language:ScalaLicense:Apache-2.0Stargazers:16Issues:3Issues:2