Chenyang Liu (chenliu0831)

chenliu0831

Geek Repo

Company:Bodylabs, Amazon, SageMaker

Github PK Tool:Github PK Tool

Chenyang Liu's starred repositories

superset

Apache Superset is a Data Visualization and Data Exploration Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:59563Issues:1500Issues:10301

rustlings

:crab: Small exercises to get you used to reading and writing Rust code!

Language:RustLicense:MITStargazers:50094Issues:322Issues:594

ClickHouse

ClickHouse® is a real-time analytics DBMS

Language:C++License:Apache-2.0Stargazers:34746Issues:686Issues:19763

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Language:RustLicense:NOASSERTIONStargazers:26728Issues:151Issues:7481

xstate

Actor-based state management & orchestration for complex app logic.

Language:TypeScriptLicense:MITStargazers:26279Issues:188Issues:1304

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

taichi

Productive, portable, and performant GPU programming in Python.

Language:C++License:Apache-2.0Stargazers:24863Issues:389Issues:2626

yjs

Shared data types for building collaborative software

Language:JavaScriptLicense:NOASSERTIONStargazers:15407Issues:120Issues:439

Signal-iOS

A private messenger for iOS.

Language:SwiftLicense:AGPL-3.0Stargazers:10531Issues:379Issues:3160

great_expectations

Always know what to expect from your data.

Language:PythonLicense:Apache-2.0Stargazers:9543Issues:82Issues:1825
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9446Issues:95Issues:202

vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Language:PythonLicense:MITStargazers:8188Issues:143Issues:1208

deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Language:PythonLicense:MPL-2.0Stargazers:7772Issues:86Issues:450

lux

Automatically visualize your pandas dataframe via a single print! 📊 💡

Language:PythonLicense:Apache-2.0Stargazers:5034Issues:89Issues:234

AugLy

A data augmentations library for audio, image, text, and video.

Language:PythonLicense:NOASSERTIONStargazers:4908Issues:71Issues:74

tf-quant-finance

High-performance TensorFlow library for quantitative finance.

Language:PythonLicense:Apache-2.0Stargazers:4320Issues:166Issues:54

git-branchless

High-velocity, monorepo-scale workflow for Git

Language:RustLicense:Apache-2.0Stargazers:3331Issues:22Issues:213
Language:PythonLicense:Apache-2.0Stargazers:1325Issues:37Issues:53

PyFlow

An open-source tool for visual and modular block programming in python

Language:PythonLicense:GPL-3.0Stargazers:1249Issues:22Issues:184

data

A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

Language:PythonLicense:BSD-3-ClauseStargazers:1070Issues:35Issues:475

redwood

A highly-configurable, distributed, realtime database that manages a state tree shared among many peers.

Language:GoLicense:MITStargazers:847Issues:21Issues:129

blaze

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Language:RustLicense:Apache-2.0Stargazers:812Issues:23Issues:65

spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Language:ScalaLicense:Apache-2.0Stargazers:730Issues:42Issues:4875

snowflake-connector-python

Snowflake Connector for Python

Language:PythonLicense:Apache-2.0Stargazers:556Issues:39Issues:666

syne-tune

Large scale and asynchronous Hyperparameter and Architecture Optimization at your fingertips.

Language:PythonLicense:Apache-2.0Stargazers:366Issues:12Issues:120

regular-table

A regular <table> library, for async and virtual data models.

Language:JavaScriptLicense:Apache-2.0Stargazers:283Issues:0Issues:0

deltacat

A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.

Language:PythonLicense:Apache-2.0Stargazers:98Issues:9Issues:90

lux-widget

Jupyter Widget for Lux

Language:HTMLLicense:Apache-2.0Stargazers:72Issues:7Issues:26

recoreco

Fast item-to-item recommendations on the command line.

Language:RustLicense:GPL-3.0Stargazers:35Issues:3Issues:2

dirty_cat

Machine learning on dirty tabular data (legacy clone of skrub)

Language:PythonLicense:BSD-3-ClauseStargazers:9Issues:0Issues:1